Data Scientist Resume Template
Data Scientist Resume Template
► Associate Big Data & EDW at Celebal ► Databricks(pyspark) ► Databricks partner course and Capstone
Technologies since January 2022 Spark architecture badges.
► Worked as a Research Fellow at Tata Analyzing dataset and extracting the needy Have received databricks certification of
Institute of Fundamental research data. Associate developer for Apache Spark 3.0
► Have experience of teaching Engineering Spark SQL Have received databricks certification of
students Delta lake Associate Data Engineer
► Experience in python, Machine Learning, Have received Developer foundations badge.
Big Data, pyspark ► Data science
Have received Developer essentials badge.
► Experience in databricks ecosystem, that Have completed 3 short projects in Data
Have received Partner Solutions Architect
is, spark core and spark SQL science using python
Essentials badge
Used various machine learning algorithms
The badges involved working on projects that
like K-Nearest Neighbors, Naïve Bayes,
simulated real life scenarios, For example
Skills Logistic Regression, Decision Tree, Random
movie stores data in a data lake, which is
Forest, Discriminant Analysis, XGB and also
growing exponentially. These projects also
Artificial Neural Networks in the projects.
involved implementing a multi-hop Delta Lake
► Bigdata
► SQL architecture using Spark Structured Streaming
► Data science and Machine Learning
► Databricks (Pyspark) Performed Joins and Union operations.
Solved various hackerrank problems ► Spark structured Streaming
► Python
Implemented and utilizing Dataframes in
► SQL
Spark using Python and Spark SQL API.
► LINUX ► Python
Performing extraction and analytics queries
► PowerBI Have used python in Data science as well
► on streamed data using spark with python.
Java (core) as pure science projects
►
Scientific computing using C and Fortran Data science projects involved using
► Scientific reporting using matplotlib ► BIg Data
libraries such as numpy, scipy, pandas,
worked on determining the ideal schema
scikitlearn, tensorflow, XGB, etc.
Research Publications Pure science projects involved using handling of corrupt records and fixing them.
Flattening of corrupted records
libraries like numpy, scipy, matplotlib, etc.
► Have published 3 papers in the field of Apache spark functions
cosmology in reputed international
journals Education Objective
► The works involved scientific computing, My experience in data analysis and data
simulation & analysis of data and also ► B. Tech. in Engineering Physics from IITB analytics while working on science projects got
scientific reporting (year 2001) me interested in Big Data Analytics and that is
► MS in Physics from MIT, USA (year 2004-05) what I plan to pursue in future.
Page 1