Harsha Data Scientist 6 Years
Harsha Data Scientist 6 Years
About
Specialized Data Scientist with a demonstrated experience of 6 years into Machine Learning, Statistical Modelling, Data Mining ,Deep Learning , NLP , Python, Spark ,SQL &
Tableau . Skilled in Data Science process like hypothesis generation, exploratory data analysis, data cleaning, model building, delivering data science pipelines on cloud platforms with
CI/CD pipelines , measure impact of applied models using A/B Testing , results interpretation, and implementation. Good working knowledge on forecasting models, Clustering
analysis/segmentation and recommendation engines.
Skills
Programming Language : Python, Microsoft SQL Server , Hadoop,Java, Spark, Hive, Big Data, Mongo DB, GIT , React and Jira .
Python Packages/IDE’s : Scikit-learn, NLTK, spaCy, Keras, Jupyter, Pandas, NumPy, PyTorch, Matplotlib, OpenCV, imutils, Seaborn, streamlit, pyhive,
SHAP & LIME ,Keras, TensorFlow, TensorFlow Object Detection, ImageAI, Cognitive Services
Cloud Applications : Azure App Services, Azure ML, ML Services, Synapse ,Micro services, Containers, Functions, Azure Data Bricks, Cloud Storage
(BLOBS, Data Lakes, etc.) and Cloud Databases, Azure Cognitive Services, Azure PaaS, Web Jobs, Azure API Management
Service, App Services, and virtualization.
Classification and Regression : KNN, Naive Bayes, Linear Regression, Logistic Regression, SVM, Decision Tree,PCA, Random Forest , XGBOOST, LighGBM,
CATBOOST and Ensemble Models
Time Series and Forecasting : Moving Averages, ARIMA, Facebook Prophet, Exponential Smoothing , Holt’s Winter seasonal Approach.
Text Analytics(NLP) : BOW, TF-IDF, Word2Vec , Doc2Vec, Transformers,Stemming, Lemmatization, Data Cleaning
Work History
Senior Data Scientist Fragma Data Systems, Dubai, UAE [June 2019 to Present]
Predictive Maintenance:
Designed the predictive maintenance solution to estimate the RUL of a machine using the IOT/sensor data of machine equipment’s.
Created maintenance and breakdown cycles of a machine by crunching the IOT data of 2 years to derive the RUL.
Developed Ensembled XGBOOST + Linear Regressor models to predict the RUL of a machine.
Deployed the solution using Azure ML, ML Services with Azure BLOB storage and Functions . Used Containers for the
deployment. Used Azure Web jobs for data streaming jobs from IOT devices.
Certifications
Sep 2020 Microsoft certified Data Science Associate