Resume of Experienced Data Scientist
Resume of Experienced Data Scientist
Plot No. 341, Flat No.102, Sector-4, Ghansoli, Navi-Mumbai, 400701, India
+917974697771
[email protected]
http://shubhamsharmaportfolio.com
https://www.linkedin.com/in/shubham-sharma-8889893109/
https://stackoverflow.com/users/4786793/shubham-sharma
https://www.kaggle.com/shubhamcagel
Career Objective:
Experienced Data Scientist working as High Performing asset of the organization along with Strategic-thinking and
Multitasking technologists with experience in Python,R,Machine Learning,Data Science,Data Analytics, Production ready
Machine learning and predictive model deployment, Functional and Architectural Engineering,Smart Contracts,Solidity.
Academic Records:
Professional Qualifications:
Pursued Bachelor of Engineering from Acropolis Institute of Technology and Research, Indore Affiliated to RGTU with Specialization in
Computer Science & Engineering (2013-2017) (CGPA-80.03%)
Educational Qualification:
Senior Secondary School Certificate (10+2) from Board of Secondary Education, Bhopal M.P. with 83% in the year 2013
High School Certificate (10th) from Board of Secondary Education, Bhopal M.P. with 91.3% in the year 2010-11
IT Skills:
Language : C, C++, Java, Python(Data Science , Data Analytics,Web), R
Data Analysis : Python-statsmodel , Python-ScikitLearn , Pandas , Anaconda
Data Visualization : Python-Orange3 , Matplotlib , Seaborn , Spotfire , Zoomdata
Data Mining : Python-Requests , Scrapy, Beautifulsoup
Data Statistics : Descriptive Analysis,Probability,Distributions,Hypothesis Testing,Chi square test,T test,Z test,ANOVA
Machine Learning : TensarFlow , Spark-MLlib , Python-Scipy , Numpy,NLP,OpenCV,WordtoVec
Cloud Development : Google Kubernetes engine, Microsoft Azure,ML Pipelines
Data Engineering : Pyspark,Spark-Mllib,Spark-Streaming ,Spark-Sql,Hadoop,Hive
SQL Database : Mysql ,Sql Server , Postgres
NOSQL Database : MongoDB
Full Stack Development : HTML 5, Django,Flask
Operating System : MS DOS, Windows xp/7, Ubuntu 14.04 LTS, Ubantu 16.04 LTS
Developer Community : Github, Kaggle, Stack-Overflow, Hacker-Rank, Code-chef, Hacker-Earth, Stack-Exchange
Experience:
I had worked as Lead Developer(Intern) for MUVR technology from November 2015 to June 2017.
I had worked Analytics Engineer in Certainty InfoTech Private Limited from June 2017 to June 2018.
I am working as Data Scientist (Manager Permanent Role) in Reliance Industries Private Limited from June 2018 to till now.
Training and Projects:
Machine Learning :
Organization : Stanford University Machine Learning by Andrew NG
Description : Done Projects of Machine Learning
Deep Learning :
Organization : Stanford University Deep Learning by Andrew NG
Description : Done Projects of Deep Learning
Google Cloud :
Organization : Google Cloud
Description : Course were focused on Machine learning with Tensor flow using GCP
Python Development:
Organization : Mukul World Institute, Patna
Description : We had Worked on advanced Python Data Analytics Frameworks like Pandas,TensarFlow,Spark
Live Projects under Current Job Period with Reliance Industries Private Limited:
Data Science Related Projects:
Data Science and Analytics Projects:
Title : Jio Mart Grosery product recommendations
Role : I worked as Data Scientist, Statistical and Programming Model Developer
Data Science : Rank Based , user-user similarity-based , item-item similarity Recommendation, GridSearchCV
Related Work : It is used by the Retail team to recommend products to customers based on their previous
ratings for other products.
Deployment : Deployed model on Microsoft Azure , used Azure devops,ngnix,Kubernates etc
Tittle : Customer churn analysis for Jio customers
Role : I worked as Data Science Engineer,Statistical and Programming Model Developer
Programming Libraries: Python-Sklearn , Pandas , Scipy , NLP,Keras , Tensorflow , Numpy
Deployment : Deployed model on Microsoft Azure , used Azure devops,ngnix,Kubernates etc
Tittle : Audit Fleet Card Misutilization Analytics for Fleet Card owners
Role : I worked as Data Scientist as well as Audit Analytics developer and facilitator for project
Data Science : Data ingestion,Data modeling,Data analytics,Visualization,Data Lake
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy, PySpark
Deployment : Deployed model on Google Cloud used GKE, DataProc
Title : Tool for any kind of PDF content extraction as well as searching
Role : I workedas Data Scientist, Pattern Recognition,Regular Expression,NLP,Extractive text summarization
Data Science : Extractive text summarization, OCR , OpenCV
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy, Word2vec,NLTK,re,regex,opencv,tesseract,Mongodb
Deployment : Deployed model on Microsoft Azure , used Azure devops,ngnix,Kubernates etc
Tittle : Created and analyzed live and historical Bidding and Offer Data of Reliance Petrolium
Role : I had worked as Data Scientist, Statistical and Programming Model Developer and Python Developer
Data Science : Statistics ,Data munging ,Data Cleansing, Variable Selection, Hypothesis testing , Predictions , Model building
Filteration,aggregation,Data manipulation
Related Technologies : Websocket technology and S&P Global plats api
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy , Pyboruta , Crossvalidations, Pivottablejs,BeautifulSoup,Multithreading,
Matplotlib,Plotty
Deployment : Deployed model on Google Cloud used GKE,Vertex AI
Tittle : Created Oil and Gas Price Prediction Models for RNM(R&D)Team Reliance
Role : I had worked as Data Scientist, Statistical and Programming Model Developer and SparkR Developer
Data Science : Statistics ,Data munging ,Data Cleansing, Variable Selection, Hypothesis testing , Predictions , Model building
Related Technologies : Chemical Engineering Related , Oil Viscosity, Least Angle Regression , Gradient Descent
Programming Libraries: R , SparkR,GGplot,Zeppelin,Zoomdata
Deployment : Deployed model on Google Cloud used GKE,Vertex AI
Tittle : CostaRica Travel Agency Customer conversion analysis and improvement in customer conversion rate
Role : I had worked as Data Science Engineer,Statistical and Programming Model Developer
Data Science : Statistics ,Data munging ,Data Cleansing, Variable Selection, Hypothesis testing , Predictions , Model building
Django Related Work : Created Django Backend of same project Using Pydev(Eclipse) and putted programming model in that
DataBase : SQL Server
Hosting : Hosting is done on Linux instance with Apache and Mod_WSGI
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy , Pyboruta , Crossvalidations, django,django-mssql,django-azure,pyodbc
Duration : Duration of project was 50 days
Tittle : I had worked on Image Analysis plugin of Orange3 for Hungary Client
Role : I had worked as Data Science Engineer,Statistical and Programming Model Developer and Python Developer
Data Visualization : Microsoft-Excel, Orange3 Software
Data Science : Data Prepration, Data Science Model Development,Optimization of Machine Learning model , Predictions
Programming Libraries: Anaconda distribution,Pip , Conda
Duration : Duration of project was 5 days
Tittle : Analysis of sales Data and automation of the same for San Diego client
Role : I had worked as Data Science Engineer,Statistical and Programming Model Developer and Python Developer
Data Science : Data Prepration, Data Science Model Development,Optimization of Machine Learning model , Predictions
UI Related Work : Created TKinter (library in python language) based user interface for the same
Input or backend : It was having Excel file with data sheets for Sales,Customer,Items data
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy , scipy.norms, Matplotlib, , Pyboruta , Crossvalidations,Python-Tk
Duration : Duration of project was 15 days
Tittle : Tibco Spotfire Visualization Project on Empirical-Forecast-WF-20170607 – Spotfire.Docx for Brazil Client
Role : I had worked as Data Science Engineer,Statistical and Programming Model Developer and Python Developer
Data Science : Tibco Spotfire,TERR,R,Linear Regression,Logistic Regression,Decision Tree Regression
Programming Libraries: Python-Sklearn , Pandas , Scipy , Numpy, Matplotlib, Statsmodels
Duration : Duration of project was 20 days
Tittle : MUVR application , it provides travelling services at your doorstep in Gujarat. It has 10+ installs
Role : I worked as Data Science Intern and as a Team Leader
play Store link : https://play.google.com/store/apps/details?id=com.muvr.partner
Data Science : Linear Classification,Logistic Classification,Decision Tree Classification,SVM,KNN,Keras
Client : It is product of MUVR company in which I worked
Tittle : Samaac Application , it provides all kind of home services at your door step. It has 100+ installs
Role : I worked as Data Science Intern , Worked on finding common pattern of customers
play Store link :https://play.google.com/store/apps/details?id=com.ionicframework.samaac196793
Data Science : QDA,Decision Tree,Random Forest,SVM,KNN,Keras
Client : Jabalpur Company Samaac Technologies http://www.samaac.com/
Tittle :Newspaperwala application ,used for monthly management of all newspaper venders
Role : I worked as Data Science Intern
play Store link : https://play.google.com/store/apps/details?id=com.technotwit.newspaperwala
Data Science : Linear Classification, QDA,Decision Tree,Random Forest,SVM,KNN,Keras
Client : Maharashtra Company Techno twit Solution http://techno-twit.com/
Tittle : Automation Project in Python for blog backend for USA client
Role : Python-Developer
Description : I had automated overall project to push posts, retrieve and also solved client problem
Tittle : I had created chess Game with Advanced Data Structure as Stack , Queue , Artificial intelligence in Python
Role : Python-Developer, Researcher
Project Link : https://groups.google.com/forum/#!topic/aitr_cs_2011/LohDxCILEvE
Description : I had created chess Game with Advanced Data Structure as Stack , Queue , Artificial intelligence in Python
Java Projects:
Tittle : It is car pooling system project
Role : I had worked as Hibernate Java Developer with Designing of whole Software
Project Link :https://github.com/shubhamsharmacs/CarPoolingSystem
Database : Mysql , JFrame , Object-Relational-Mapping
Attended Conferences:
Organization : Google.Inc in association with Maharashtra Institute of Technology, Aurangabad.
Tittle : I was invited as Chief Guest for MLCC Study Jam Grand Event .
Delivered Lecture On : I delivered lecture on “Data Science in AI Neural Networks with Tensorflow”.
Research Paper:
Presented Research paper on – “Search engine optimization with page rank algorithm " at "JNU Delhi" with reference number
ESM286 and it is available here :- http://www.academicscience.co.in/admin/resources/project/paper/f201509231442999506.pdf
Presented Research paper on “Introduction to Map Reduce” in IJCAM Conference at,“Chameli Devi Group of Institute Indore” in
”2016” which is available here :-http://www.ijcam.com/paperdownload.php?filename=Introduction__To_MapRedcuce.pdf
Presented Research paper on “Big Data Analytics Using Apache Spark On IOT” in National Conference on Contemporary Computing
at,“Chameli Devi Group of Institute Indore” in ”2017”
Extracurricular Activities:
Awarded with title of Vidhya Bhusan by Amul Company in year 2012-13.
Selected as Microsoft Student Partner in the session 2015-16.
Cleared JEE-Main in 2013-14.
st
Secured 1 prize in inter college chess competition in year 2014.
Secured 2nd prize in inter college chess competition in year 2017 as a caption.
Selected as campus ambassador for event “Voodoo or Die” contest held by “Voodoo incorporation” In 2015.
st
Secured 1 rank in Innovative Idea Competition in 2015 at tech fest at college level.
Selected in National Talent search examination among top-100 contestants by NICT.
Secured 1st rank in Science Exhibition Competition in 2013.
Selected for Microsoft Gift Voucher in April 2017 among 100 to 150 candidates.
Worked as Trainer for ROR in Acropolis Institute from 09 March 2017 to 03 April 2017.
Technical Achievements:
I had achieved total of 63 different badges in which 4 golden badge, 20 silver badges and 39 are bronze badge which are very rare for
anyone to achieve till 18 August 2019. My reputation is 1960 till 18 August 2019 at StackOverflow.com.
Successfully completed “Machine Learning by Andrew NG” Cloudera course by submitting 25 Data Science Assignments to Stanford
University.
Successfully completed “Neural Networks and Deep Learning Specializations by Andrew NG” Cloudera course by submitting 8 Data
Science Assignments to Stanford University.
Successfully completed “Google Data Analytics Professional Certificate by Google” Cloudera certification by submitting 8 Data
Science assignments.
Successfully completed “Google Cloud Big Data and Machine Learning Fundamentals” Coursera course by Google Cloud .
Successfully completed “MLOPS Fundamentals on GCP” Coursera course by Google Cloud .
Successfully completed “End to End Machine Learning with Tensorflow on GCP” Coursera course by Google Cloud .
Successfully completed “The Data Scientist Toolbox” Coursera course by John Hopkins University.
Successfully completed “Guided Tour of Machine Learning in Finance” Coursera course by New York University .
Successfully completed “IBM Blockchain Foundation for Developers” Coursera course by IBM .
Successfully completed “Applied Text Mining in Python” Coursera course by University of Michigan.
Successfully completed “Mathematics for Machine Learning Specialization” Coursera course by Imperial College London.
Successfully completed “Applied Machine Learning in Python” Coursera course by University of Michigan .
Successfully completed “Convolutional Neural Networks” Coursera course by DeepLearning.AI.
Successfully completed “Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization” Coursera
course by DeepLearning.AI.
Successfully completed “Neural Networks and Deep Learning” Coursera course by DeepLearning.AI.
Pursuing “IBM Data Science Professional Certificate” Coursera course by IBM.
Pursuing “IBM Data Engineering Professional Certificate” Coursera course by IBM.
Successfully completed “Microsoft Technical Associate 98-364 Database Fundamentals”.
Successfully completed Google course “Applied CS With Android”by submitting 4 Android projects to Google.
Successfully completed “JavaScript Fundamentals for Absolute Beginners” by Microsoft Virtual Academy.
Successfully completed “Introduction to Programming with Python” by Microsoft Virtual Academy.
Successfully completed “M101P: MongoDB for Developers” by MongoDB University.
Successfully completed “Statics with R” course by Udemy University.
Successfully completed “Python Course Season 1” by Mukul World.
Successfully completed “Scala Programming For Beginners Complete Guide” by Udemy University .
Successfully completed “Pig For Wrangling Big Data” by Udemy University .
Successfully completed “Android App Developmement with Parse” course by Udemy University .
Successfully completed “React VR Creating Virtual Reality Apps” course by Udemy University .
Successfully completed “Get your website on web” course by Udemy University.
Successfully completed “Modern Web Design using HTML5 and CSS3” course by Udemy University.
Successfully completed “Angular 2 With TypeScript” course by Udemy University.
Successfully completed “Networking for Amazon AWS” course by Udemy University.
Successfully completed “Introduction to C and C++” by Cuboid Institute.
Selected as intern by Persistent company for annual project competition.
Successfully attended workshop of Amazon Tech Essential by Amazon India in duration April 2017
Successfully completed 6 month internship project for Persistent Company in duration 2016-17
Strengths: Consistency, Dedication, Love to coding, Keen to Statistics, Hunger for Data Related Problems, Team Leadership.
Hobbies:
Kaggler, Programming, Research, Puzzles and Solving Development related Issues of other Developers on Stack Overflow, Github, Chess
Personal Details:
Date of Birth : 12/12/1995
Gender : Male
Nationality : Indian
Marital Status : Unmarried
Languages known : Hindi, English
Mother Tongue : Hindi
Father’s Name : Mr. Virendra Sharma
References:
Mr.Anujay Saraf Sr. AI-Data Scientist Sapient ,India.
Ms. Anshul Kamboj Manager at Wipro ,India .
Declaration:
I hereby declare that the information given above is true to the best of my knowledge & belief