Data Science Roadmap 2025
Data Science Roadmap 2025
If you feel like you need more support, you can book a 1:1 call with me on topmate.io.
And don't forget to check out my YouTube channel for more Data Science education
and motivation.
Also, no pressure at all, but if you nd this guide helpful and feel so inclined, you can
click here to buy me a coffee.
Should You
Still Become
a Data
Scientist?
Right away, let’s address a question you
might be worried about: Is it even worth it
to become a Data Scientist now? With all
the layoffs in tech and the increasing
presence of AI, it’s natural to question
whether data science is still a smart career
choice.
Here’s why I believe that it is: First, while
tech layoffs have absolutely been real, they
don’t tell the whole story. Even with recent
cutbacks, data science remains one of
the most in-demand elds. According to
the U.S. Bureau of Labor Statistics,
employment of Data Scientists is
projected to grow 36% from 2023 to 2033,
which is much faster than most other
professions.
And what about AI? It’s true that AI is
reshaping industries, but it’s not
eliminating Data Scientist roles – at least
not yet. More importantly, the core skills
you’ll develop—like programming,
statistics, and critical thinking—are
valuable across many industries, even if
you do need to pivot in the future.
So, while it might be harder to break into
data science now compared to a few years
ago, the rewards are absolutely still there
for those who are persistent and focused.
If you’re willing to put in the effort, this
path can lead to a well-paying,
intellectually rewarding career with plenty
of opportunities for growth.
In This Roadmap
Timeline
Area of Focus
What to Learn
Things NOT to Learn
Marketing Your Experience
Interview Prep
How to Do Well When You Start Your First Job
Timeline
You’ve probably seen videos or courses promising you can become a Data Scientist in
just a few weeks or a few months.
While that might be realistic for someone with a super strong technical background
and a lot of free time, for most beginners, it’s not going to be feasible.
Data science isn’t easy. It’s not a job where you just learn a few simple things and
then coast. Building the necessary skills takes time, but it’s worth the investment.
This process is de nitely a marathon, not a sprint.
That being said, if you can dedicate about 20 hours a week to learning, this 12-
month timeline will guide you step-by-step from beginner to that rst job.
So if you’re starting in January, this timeline ts perfectly. If not, don’t worry, just use it
as a exible template. Let’s break it down.
January to
June
(Weeks 1-
26)
Building the Basics
The rst six months are all about establishing a strong foundation.
During this phase, you’ll learn the essential skills every Data Scientist
needs, while also starting to build your network.
Step 1 Join the Data Science
Community
First things rst, we’re going to start meeting people. Find local
meetups, Discord servers, or LinkedIn communities where Data
Scientists connect.
Building relationships early will help you down the road, whether it’s
nding mentors, collaborators, or even job opportunities. Plus, you’ll
start learning the language of the eld and getting familiar with the
kinds of things people are working on, which will be helpful when
you’re coming up with projects and preparing for interviews.
Pros
• These roles often have a strong business impact and can be
rewarding if you like problem-solving in that context.
• Typically have a shorter learning curve for beginners since they
rely more on data wrangling and basic statistical skills.
• Great foundation if you’re eventually interested in a broader data
science role.
Cons
• Can be less technically challenging. If you’re looking to dive into
machine learning or advanced statistical modeling, this may not
be totally satisfying for you.
• Compensation can be slightly lower than in more specialized
data science or ML-focused roles.
2. Data Science Generalist
Like the name implies, data science generalist roles require a mix of
skills, from data wrangling and statistics to building predictive
models. If you enjoy variety and exibility, this path allows you to
work across the entire data science pipeline—from exploratory data
analysis to deploying machine learning models.
Typical Tasks
• Cleaning and analyzing data to uncover patterns.
• Building and testing predictive models.
• Communicating ndings through visualizations and reports.
Pros
• Offers a wide range of learning opportunities since you’ll touch
every aspect of the data science work ow.
• Opens up multiple career paths, allowing you to specialize later
on in analytics, statistical modeling, or machine learning.
• Often requires less technical depth in math and ML but still gives
you a solid, broad foundation.
Cons
• You’ll need to be exible and comfortable working on different
types of projects.
• The breadth of skills required can feel overwhelming at times,
especially if you’re just starting.
3. Machine Learning-Focused
If the technical and engineering side of data science sounds fun to
you, an ML-focused role might be the best t. These roles often
require deeper programming, math, and algorithmic knowledge,
making them more challenging in some ways, but also (in my
opinion) really fun.
These roles are more likely to allow you to have solo deep work time
and are generally less collaborative than generalist or analytics roles.
Typical Tasks
• Developing machine learning models.
• Deploying models to production systems.
• Potentially working on advanced algorithms for computer vision,
natural language processing, or applications like recommender
systems.
Pros
• Highly specialized skill set that’s in demand, especially in tech-
driven companies, and especially right now.
• Often offers higher compensation due to the technical
complexity and lack of talent to ll open roles.
• Opportunity to work at the cutting edge of technology.
Cons
• Steeper learning curve, especially for those without a technical
background.
• Requires more engineering-type work, which may or may not
appeal to you.
Which One Should
You Choose?
For this roadmap, we’ll assume you’re aiming for a Data Science
Generalist role. It’s a great starting point because it provides
exibility and opens doors to both analytics and machine learning
positions later.
That said, the advice in this roadmap applies across all three paths.
The key difference lies in what you prioritize in your learning:
Timeline
Plan to spend about four weeks mastering the basics of Python that
we went over.
Example Projects
To solidify your learning, here are some example projects to put
things into practice:
Write a program to process words in a text le
• Read in some data.
• Count the frequency of each word and identify the top 10 most
frequent words.
• Plot the top words.
• Save the results into a new le.
Create a basic calculator tool
• Write functions for addition, subtraction, multiplication, and
division.
• Take user input and perform the selected operation.
• Add exception handling to manage invalid inputs or division by
zero.
• Print the results.
Probability & Statistics
At the same time, we’re going to start studying statistics. As a Data
Scientist, you’ll use statistics in nearly every aspect of your work—
from analyzing datasets to evaluating machine learning models, and
even designing experiments. The better your grasp of statistics, the
better your ability to interpret results and not make huge mistakes
that cost the company millions!
Just kidding. Kind of. We really do want to know what we’re talking
about when we make decisions based on data.
Just like with Python, start with the basics and build your
knowledge over time. Here’s a roadmap of some key topics:
1. Descriptive Statistics
• Understand measures of central tendency: mean, median, and
mode.
• Learn measures of dispersion: like variance and standard
deviation.
• Explore common distributions like the normal distribution,
skewed distributions, and uniform distribution.
2. Data Characteristics
• Differentiate between continuous vs. discrete variables and
nominal vs. ordinal data.
• Learn how to identify and handle outliers.
3. Correlation and Multicollinearity
• Understand the difference between correlation and covariance.
• Learn about multicollinearity and its implications for regression
models.
4. Inferential Statistics
• Understand hypothesis testing, including p-values, z-tests, t-tests,
and chi-squared and ANOVA.
• Dive into basic probability concepts, including Bayes’ Theorem
and randomization.
• Understand con dence intervals.
• Learn about sampling techniques, resampling methods, and
common statistical biases.
5. Experimentation Basics
• How to determine sample size.
• How to decide how long to run the test.
• Learn about the importance of randomization in experiments.
Bonus
Explore causal inference techniques to evaluate cause-and-effect
relationships when we can’t run an experiment.
Learning Resources
• This Udemy course is a super gentle introduction (wait for
sales/use coupon codes for Udemy)
• StatQuest (YouTube channel): Fantastic for breaking down
complex topics in an easy-to-understand way.
• Khan Academy: Offers a structured and beginner-friendly
approach to core statistics topics.
• Practical Statistics for Data Scientists book (af liate link): A great
hands-on resource that ties statistical concepts directly to data
science applications in R and Python.
Timeline
Dedicate three-four weeks to studying statistics alongside Python
(so, it’ll take around eight weeks total to learn the basics of both).
Spend the rst week focusing on descriptive statistics and data
characteristics, then move on to inferential statistics and
experimentation in the following weeks.
Example Projects
• Download a Kaggle sales dataset and analyze the distribution of
sales in each region and over time.
• Download a survey dataset from Kaggle and analyze stats for
question answers, nd outliers, and compare answers across
groups (demographic or age etc.)
SQL
Next, we’re introducing another coding language: SQL. No matter where you work,
you’ll almost certainly use SQL to interact with data, because this is the language we
use to query data from a database. While some smaller organizations may rely only
on CSV les, most companies use relational databases to store their data, and SQL is
the primary tool for accessing and manipulating it.
Luckily, it’s pretty easy! Especially if we just focus on the core concepts we need to get
started in data science:
1. Core Querying Skills
• Learn how to write queries using SELECT, WHERE, ORDER BY, and LIMIT.
• Understand how to deal with NULL values effectively to avoid errors or unexpected
results.
• Get comfortable with handling dates and timestamps.
2. Intermediate Features
• Master JOINS: Understand the differences between INNER JOIN, LEFT JOIN, RIGHT
JOIN, and FULL OUTER JOIN.
• Learn to use UNIONs and self-joins for more complex queries.
• Explore CASE statements to add conditional logic to your queries.
• Understand GROUP BY and aggregate functions (e.g., COUNT, AVG, SUM, MIN,
MAX) for summarizing data.
3. Advanced Techniques
• Dive into Common Table Expressions (CTEs) to break down complex queries into
manageable pieces.
• Learn how to write and optimize subqueries.
• Explore window functions like RANK, ROW_NUMBER, and LAG to perform
advanced analytics within a dataset. This will almost de nitely come up in
interviews, so don’t skip it.
4. Query Optimization
Timeline
Dedicate two weeks to learning the basics of SQL.
Example Projects
For SQL, just work through practice problems in the interactive
hands-on learning platforms linked above.
Tableau/Power
BI/Data Visualization
As a Data Scientist, a lot of your job is about telling a story with data.
Data visualization tools like Tableau, Power BI, and Python data
visualization libraries allow you to transform raw data into insights
that non-technical stakeholders can easily understand.
While you don’t need deep expertise in these tools to get hired,
being familiar with them and having them listed on your resume is
useful.
For your rst role, focus on the fundamentals of data visualization,
both with dedicated tools and programming libraries:
1. Basics of Tableau and Power BI
• Connect to a dataset and explore the interface.
• Create basic charts like bar plots, line graphs, and scatter plots.
• Combine visuals into dashboards and add interactivity (e.g., lters
or slicers).
2. Python Visualization Libraries
• Generate static and interactive visualizations for data exploration.
• Customize plots with titles, labels, and legends.
Learning Resources
• Intro to Tableau
• Intro to Power BI
• Intro to Python visualization libraries
Timeline
Spend one week learning the basics of Tableau, Power BI, and Python-based
visualization libraries.
Example Projects
1. Sales Dashboard
3. Movie Reviews
• Repositories (Repos): These are like folders for your project that track all changes to
les.
• Version Control: Learn how Git tracks changes to your les and allows you to move
back to previous versions if needed.
• Branches: Use branches to experiment with new features or ideas without
affecting your main codebase.
• Merging: Combine branches back into the main branch once your changes are
ready.
• Con ict Resolution: Handle situations where multiple people make changes to the
same part of a le.
• How to use the GitHub platform to interact with Git.
Timeline
Spend ½ week learning the basics of Git.
Example Projects
Start by creating a personal project repository and practicing
commands. As you work on projects, use Git consistently to develop
good habits.
Check-in
At this point, you’ve been studying for about 2.5 months. If you started in January,
we’re now around mid-March, and you’ve already built a strong foundation in Python,
Statistics, SQL, and Data Visualization. Here’s the neat part: You now have the skills to
be a Data Analyst.
This doesn’t mean you’ve completed your journey to becoming a Data Scientist, but it
does mean you’re ready to start marketing yourself as someone who’s job-ready now.
So at this point we’re going to take some time to start putting yourself out there in
the job market.
2. Update GitHub
Make sure your GitHub is super clean with all your work so far (we’ll talk more about
this in the Marketing Your Experience section).
Learning Resources
Luckily, the perfect resource for this already exists: Just watch
3blue1brown’s Essence of Linear Algebra and Essence of Calculus
playlists.
If you want to dig deeper after that, I’d recommend:
Timeline
Plan to dedicate two weeks to learning these math fundamentals.
Again, focus on understanding the intuition of what is happening vs.
the calculations themselves. You can always come back to the math
later in your career.
Machine Learning
Now we’re ready to get started with the most fun part, in my opinion. Machine
learning!
At its core, data science is about solving problems using data. Machine learning is one
of the most powerful tools in your toolkit because it enables you to uncover patterns
and make predictions from data.
Before diving into the speci c algorithms, there’s a lot we need to understand rst:
• Data Cleaning: Learn how to x messy data, handle missing values, and deal with
inconsistent entries.
• Feature Engineering: Turn raw data into useful inputs for your model by creating
features.
• Bootstrapping: Understand how to use sampling to estimate the properties of
your data.
• Feature Preparation: Handle common issues like scaling, imputing missing values
(NULLs), and normalizing your features.
• Train-Test Split and Avoiding Data Leakage: Keep your training and testing data
separate to ensure your models generalize well.
• Exploratory Data Analysis (EDA): Know how to summarize, visualize, and explore
data to spot trends and outliers.
• Supervised vs. Unsupervised Learning: Learn the difference between working
with labeled data vs. nding patterns in unlabeled data.
• Classi cation vs. Regression: Understand when to predict categories
(classi cation) vs. continuous numbers (regression).
• Metrics: Master regression metrics (e.g., MAE, RMSE) and classi cation metrics
(e.g., accuracy, precision, recall, F1, and AUC score).
Once you have some of this background, you can get started with some of the
fundamental algorithms:
• Linear regression
• Logistic regression
• Decision Trees
• K-means Clustering
Learning Resources
• Andrew Ng's classic ML Specialization on Coursera (af liate link)
• A Gentle Introduction to Machine Learning StatQuest Playlist
• DataCamp Machine Learning Fundamentals in Python
• Google's Machine Learning Crash Course
Timeline
Expect to spend about four weeks learning the fundamentals.
Example Projects
Work through all of the projects in the courses.
• Over tting: Understand what
it is and how to tackle it.
• Regularization: Learn
techniques like ridge and lasso
regression to keep models
from over tting.
• Cross-Validation: Validate your
models properly by splitting
data into multiple
training/testing sets.
• Principal Component Analysis
(PCA): Use PCA to reduce
dimensionality and focus on
the most informative aspects
Then, we’re ready to move of your data.
on to some more • Hyperparameter Tuning:
advanced concepts to Learn how tweaking model
help us handle real-world settings can boost
challenges. performance.
• Ensemble Models: Get familiar
with random forests and
gradient boosting models (e.g.,
XGBoost).
• Time Series Basics: Learn how
to analyze data over time,
focusing on trends, seasonality,
and simple forecasting
methods.
• Introduction to Deep
Learning: Understand the
basics of neural networks, how
they work, and when to use
them.
Learning Resources
• Time Series models
• Gradient Boosting
• Deep Learning:
◦ 3Blue1Brown Learning Intro
◦ Andrew Ng’s Deep Learning Specialization (af liate link)
• Bonus for ML specialists: Designing Machine Learning Systems
(af liate link)
Timeline
Expect to spend about four more weeks getting familiar with these
concepts. You won’t be an expert by any means, but this is kind of a
survey of what exists right now that you could use as tools in future
projects.
Example Projects
To put your learning into practice, do all of the exercises in the
courses. We’ll turn to self-directed projects soon.
Cloud Platforms
Speaking of things you should be familiar with but de nitely won’t
have time to master, we need to learn a little bit about cloud
platforms.
Cloud platforms like AWS, Google Cloud Platform (GCP), and
Microsoft Azure are essential tools for modern data science. They
allow you to work with large-scale data, deploy machine learning
models, and manage resources ef ciently. While you won’t need to
become an expert in all of them, you do need to know what they do
and how to use key features in your work ow.
AWS is the most popular, so unless you already know you’re
targeting a company that uses GCP or Azure, just focus on AWS. The
concepts are similar with the others and you can always learn them
later.
What to Know
• Storage and Databases: Learn how cloud platforms handle data
storage (e.g., Amazon S3) and how to access data securely.
• Compute Services: Understand how to use cloud resources to run
large-scale computations or train models (e.g., AWS EC2).
• ML Services: Get familiar with cloud-based tools like AWS
SageMaker for training, deploying and managing machine
learning work ows.
Learning Resources
• Fully-Managed Notebook Instances with Amazon SageMaker - a Deep Dive
SageMaker playlist
• AWS SageMaker For ML And DL Tutorial Playlist
Timeline
• Spend one week exploring the basics. Start with free-tier accounts on AWS, GCP,
or Azure to get hands-on experience.
• Dive deeper during projects when you need to deploy models or work with cloud-
based data storage and analytics.
Example Projects
Walk through example tutorials in your own environment to get started. Then, use
cloud resources in your self-directed end-to-end projects (next learning phase!)
Business Logic
Having strong business intuition is also crucial for Data Scientists. It’s
not enough to build models—you need to work on projects and
interpret results in ways that drive business impact. This is usually
tested via case studies, which are designed to test your ability to
think critically about business problems, identify key metrics, and
propose actionable solutions. These skills are essential for success in
interviews, and obviously on the job where business impact is
ultimately the thing that will drive your career forward.
What to Know
1. Understand common causes for changes in metrics, such as
seasonal trends, product changes, or external factors.
2. Break down metrics into components (e.g., customer acquisition
vs. retention) to pinpoint areas of change.
3. Key Metrics for Different Business Types:
3.1. E-commerce: Conversion rates, average order value (AOV),
customer lifetime value (CLV).
3.2. SaaS (Software as a Service): Monthly recurring revenue
(MRR), churn rate, user engagement.
3.3. Marketing: Return on investment (ROI), click-through rate
(CTR), cost per acquisition (CPA).
3.4. Operations: Ef ciency metrics, downtime, capacity utilization.
4. How to Answer Case Studies. We’ll talk about this in the interview
section.
Learning Resources
• YouTube mock interviews
• InterviewQuery has practice questions where you can read
others’ answers.
Timeline
• Spend ½ week understanding the basics of business metrics and
structured problem-solving. Basically, just get your mind thinking
about these kinds of things in general. It will take a lot longer to
get good at business intuition and answering case study
questions.
• Dedicate additional time to practice and preparation before each
interview. Regularly work on mock case studies to re ne your
approach and gain con dence.
LeetCode
The last thing on our roadmap is “LeetCode-style" coding.
If you’re not sure what that means, basically it’s a type of coding
question where you’re given algorithmic challenges to solve. These
challenges test your understanding of data structures (like arrays,
linked lists, and trees), algorithms (such as sorting, searching, and
dynamic programming), and computer science fundamentals.
I personally despise LeetCode for data science interviews, because
in my opinion it’s completely irrelevant for the kind of coding we do
on the job. Data science typically involves wrangling messy data,
building predictive models, and implementing data pipelines — not
designing the most optimal binary search algorithm. But the fact is,
a lot of companies still use it, so you need to know how to pass the
interview.
That being said, for most data science interviews it’ll just be
questions on arrays and strings, so we don’t need to go into software
engineer-level learning on data structures and algorithms.
What I’d suggest learning is:
Also, quick note that data science interviews also often test SQL, so
don’t forget to practice that as well.
Learning Resources
• Big-O Notation
• NeetCode
• Grokking the Coding Interview Patterns in Python
• You can also just walk through the LeetCode arrays questions
and ask ChatGPT to help you understand the “tricks” and optimal
solutions.
Timeline
Dedicate about ½ week to familiarizing yourself with LeetCode and
solving introductory problems. Then, start a habit of practicing one
Python and one SQL question every day until you get a job.
Summary of Skills to Learn
Primary Topic Speci c Skills Time to Learn
Python - Basics: Variables, data types, 4 weeks
loops, conditionals, functions
- Data structures: Lists,
dictionaries, sets, tuples
- File handling, exceptions
- Libraries: Pandas, Numpy,
Scikit-learn
- Package management: pip,
conda
Statistics - Descriptive statistics: Mean, 4 weeks
median, variance, distributions
- Inferential statistics:
Hypothesis testing, p-values,
con dence intervals
- Experimentation: A/B testing,
sample size, randomization
- Probability: Bayes' theorem,
basic concepts
- Multicollinearity, correlation,
causation
SQL - Core querying: SELECT, 2 weeks
WHERE, ORDER BY, LIMIT,
NULL handling
- Joins: INNER, LEFT, RIGHT,
FULL OUTER
- Aggregations: GROUP BY,
COUNT, AVG, SUM
- CASE statements, CTEs,
window functions, subqueries
Summary of Skills to
Learn
Primary Topic Speci c Skills Time to Learn
Data Visualization - Python: Matplotlib, Seaborn, 1 week
Plotly
- Dashboards: Tableau, Power BI
Git - Basic commands: init, add, 0.5 weeks
commit, push, pull, clone
- Branching and merging
- GitHub: Repository setup,
documentation
Math for ML - Vectors, matrices, dot products, 2 weeks
matrix operations
- Derivatives, partial derivatives,
gradients and gradient descent
Machine Learning - Fundamentals: Regression, 8 weeks
classi cation,
supervised/unsupervised learning
- EDA, data cleaning, feature
engineering, train-test split
- Metrics: RMSE, F1, accuracy, recall,
precision
- Fundamental aglorithms:
Regression, decision trees, k-
means
- Regularization, cross-validation,
hyperparameter tuning
- Ensemble models, time series,
deep learning
Summary of Skills to
Learn
Primary Topic Speci c Skills Time to Learn
Cloud Platforms - Storage: e.g. S3 1 week
- Compute: e.g. EC2
- ML Services: e.g.
SageMaker
Business Logic - Metric analysis: 0.5 weeks
KPIs, A/B testing
- Industry-speci c
metrics: SaaS, e-
commerce,
marketing
LeetCode Practice - Arrays, strings, Big- 0.5 weeks and
O, edge cases ongoing
- SQL challenges
Extra Credit
There are some more things that I think will be really valuable to you
in your rst job, but aren’t essential. If you have time and curiosity, I
would also recommend learning about:
• Social Media: Use Twitter, Reddit, or YouTube APIs to analyze trends, sentiment, or
user behavior.
• Economic Data: Analyze public datasets like World Bank or Census data to identify
trends in GDP, unemployment, or urbanization.
• Weather Data: Use NOAA or OpenWeatherMap APIs to analyze patterns and build
forecasting models.
Recommendation Systems
Sentiment Analysis
• Text Analysis: Analyze customer reviews, social media posts, or product feedback
to assess sentiment or extract insights.
• Topic Modeling: Use NLP techniques to identify themes in large text datasets.
Predictive Models
These internal projects not only add to your portfolio but also
demonstrate initiative, problem-solving skills, and an ability to add
value in a business context. Often, success in these projects can lead
to formal recognition, such as transitioning into a more analytical
role within the same company. Just make sure you’re following all
the rules about handling this data.
Leverage Your Network
If you aren’t currently employed or your current job doesn’t provide
opportunities for data science work, there are still plenty of options.
Start with people you already know—friends, family, or
acquaintances—who may own businesses or manage teams. For
instance:
• https://github.com/katiehuangx
• https://github.com/TeneikaAskew
• https://github.com/Erlemar
These all have a clear and engaging introduction, and easily direct the reader to
projects.
If you’ve set up a professional website, make sure it includes:
Regardless of which approach you take, each project should have its own clean,
organized GitHub repository for each project, including:
• A README with the project overview, tools used, and key ndings.
• Well-commented code and modular scripts.
• Instructions for reproducing the project (if applicable).
One thing to highlight in your write-ups is why you made the decisions you did
during the project. For example, why did you choose a speci c model or pre-
processing approach? This will help prepare you for interviews and show that you
understood the concepts well enough to make good decisions.
It’s also useful to write about your project on Medium or LinkedIn to explain your
process and results. You’ll want to focus on storytelling: what problem you solved,
how you solved it, and why it matters. Remember, your communication skills are also
important for data science, so creating an engaging narrative also highlights an
important skill for future employers.
LinkedIn
Next, we’re going to make sure your LinkedIn looks good. We’ve already done some of
this in the mid-learning check-in, but here’s a checklist to make sure everything is
polished and optimized:
• https://www.linkedin.com/in/katiehuangx/
• https://www.linkedin.com/in/teneikaaskew/
Resume
When updating your resume, keep it concise and tailored:
This advice is somewhat speci c to the U.S., but the core ideas
remain the same even if you live in a country that expects longer
CVs.
Let’s look at an example resume.
• It's one page and formatted in
an ATS-friendly way.
• Links are easy to nd.
• The summary has lots of
keywords, and describes the
candidate as already being in
the eld.
• Skills are prominently listed at
the top.
Here you can see: • Since there is a relevant
certi cate (or degree),
Education is near the top as
well.
• Then projects follow, with clear
examples of tools used and
metrics.
• Finally, professional experience
is at the end, where we tried as
much as possible to make it
relevant.
Cover Letters
Now, cover letters: While I don't personally rely on cover letters too
heavily at this point in my career, I think in the early days having a
well-crafted cover letter that shows enthusiasm for this particular
role at this particular company (with details), and transferable skills
can go a long way.
For example, assuming we’re the teacher with the resume above,
and we want to apply for a Junior Data Scientist at a bank:
How to Get Interviews
Now that we have our application materials ready and a strong
portfolio, it’s time to start applying in earnest.
We often hear that applications are just a numbers game. And while
that is true to an extent, we need to be a lot more strategic than just
sending our resume out to every posting we see (and we absolutely
don’t want to be doing this with GenAI tools).
Instead, prioritize jobs where:
1. You have a strong chance, or
2. You would really love the role.
Spend time customizing your resume – or better yet, developing a
networking strategy, which we’ll talk about next – for those roles.
If you still have time after really focusing on the high probability or
high payoff positions, then sure, go ahead and spam your resume a
bit.
What we want to do is avoid the cycle of mindlessly sending out
applications, getting rejected, feeling like there’s no point putting in
the effort, so we’re lazy with the next batch of applications, and so
on. This is a really poor approach strategically, and is bad for your
mental health.
Networking
Even more promising than applications is networking and putting
yourself out there one-on-one. In my data science mentoring, the
people I’m seeing be successful are the ones with very little social
fear. They are super proactive about reaching out to recruiters, hiring
managers, and in-person networking.
I know this sounds painful – trust me, I hate doing it too – but the
fact is that this is absolutely the highest leverage tool we have in our
toolbelt.
If you have little or no industry experience, automated resume
screening systems might lter out your application. Reaching out
directly to a recruiter can bypass this initial screen and give you a
chance to sell yourself as a candidate.
Recruiters receive a lot of messages, so keep yours short, respectful,
and to the point.
Start by brie y introducing yourself, mentioning your relevant skills,
projects, or certi cations. Explain why you’re interested in the
company speci cally, rather than sending a generic message.
For example:
“Hi [Recruiter’s Name],
I’m [Your Name], a Data Scientist with skills in Python, SQL, and machine learning. I
recently completed a project where I built a recommendation system to improve
customer engagement for an e-commerce dataset. I’m particularly interested in
[Company Name] because of your focus on [speci c product, value, or challenge]. I’d
love to connect to learn more about what you look for in data science candidates.
Thank you for your time!”
Recruiters and hiring managers are looking for people who can solve real problems.
Instead of just stating you’re interested in the role, we can even take it a step
further by showing how your skills and projects could add value.
Study the company’s products, recent news, and any challenges they might be
facing. LinkedIn, company websites, press releases, or even industry reports can give
you ideas. In your message, suggest a few ways your data science or ML skills could
address speci c needs. For example, if the company focuses on e-commerce,
mention how your experience with recommendation systems or customer
segmentation projects could enhance their user experience.
Even if you don’t have direct industry experience, reference any relevant projects,
volunteer work, or skills that demonstrate your ability to solve similar problems. For
instance, mention a project where you analyzed customer data or a volunteer role
where you optimized a process – anything that shows how you’d contribute
effectively.
Whether you’re applying to job postings or reaching out to potential contacts, one
super important thing is to remember that you’re not going to get a job by focusing
on how great the job would be for you, but how you’re the best person to solve the
company’s problems. Make sure to clarify what differentiates you from other
applicants, and how these speci c skills are best positioned to solve the speci c
problems outlined in the job description. You need to understand the role, and
communicate clearly how you plan to bring value to the company – not the other
way around.
For example:
Instead of saying: “This role is exciting to me because I’m passionate about machine
learning.”
Say: “I see this role focuses on customer segmentation, and my recent project using
clustering algorithms demonstrated how to drive actionable insights from user data.”
Be consistent about reaching out to potential connections and being thorough with
your applications for high-potential jobs, and at one point you’ll start being called for
interviews.
Interview Prep
Once you get an interview on the calendar, it’s time to start
preparing in earnest.
The key thing that I do differently with interview prep is,
essentially, over-prepare. I put in a ton of work in advance to
anticipate what might be covered, and dig deep into studying those
topics so that it looks like I came up with clever, well thought-out
solutions on the y.
This is particularly important if you lack signi cant industry
experience. Preparation will help you bridge that gap and present
yourself as someone who’s ready to learn and contribute from day
one. You cannot afford to go into an interview unprepared.
Interviews can vary widely depending on the company and the role,
so focus on the common components:
Recruiter Screen
Be prepared to give a brief introduction, summarize your background, and explain
why you’re interested in the role.
At this stage, it’s important to make a strong case for why your skills align with the
job requirements. Use keywords from the job description, reference relevant details
from their blog or website, and ensure your elevator pitch is well-rehearsed.
Review your past projects thoroughly, understanding every detail, including the
reasoning behind your decisions. This preparation ensures you’re ready if the recruiter
asks you to walk through a speci c project. Whenever possible, tailor your examples
and explanations to align with the requirements for the job.
Finally, ask thoughtful questions to gain clarity about the role and understand the
next steps in the interview process. This not only shows your interest but also helps
you prepare effectively for future stages.
Coding Questions
By now, you’ve been practicing coding, so you should be prepared.
Make sure to practice thinking out loud and coding at the same
time.
Remember, it’s not just about getting the correct answer —
interviewers are evaluating how you think, approach problems,
ask clarifying questions, and collaborate. It’s worth taking the time
to practice coding interviews with another person who can provide
feedback on these aspects. This will help you re ne your
communication skills and improve your ability to explain your
thought process effectively.
Behavioral Questions
You can get hints about the kinds of behavioral questions you might
be asked by reviewing the company’s values or leadership
principles. These often guide the questions interviewers ask to
evaluate cultural t and alignment with their priorities. Prepare at
least one example from your past experience that demonstrates
how you embody each of these values.
When answering, use the STARR method (Situation, Task, Action,
Result, Re ection) to structure your responses effectively. This
ensures your answers are clear, concise, and highlight your impact.
Focus on examples that showcase teamwork, problem-solving, and
adaptability while tying them back to the company’s core values.
Case Studies
Case studies test your ability to solve real-world problems using data
science and machine learning. Prepare by researching the business
you’re interviewing with, understanding their industry, and
anticipating relevant problems, metrics, and data types. Dive into
the company’s products, services, and goals to tailor your approach
and demonstrate domain-speci c knowledge.
Business
Context
Be ready to discuss how your
work as a Data Scientist can
impact the company’s bottom
line, decision-making processes,
or strategic goals. This shows that
you not only have technical
expertise but also understand the
business implications of your
work. Tailor your responses to
align with the company’s mission
and priorities.
Prepare
Questions for
Each Phase
For every interview stage, have
thoughtful questions prepared.
These could be about the role, the
team, the company’s challenges,
or their expectations. Asking
questions not only shows your
genuine interest but also helps
you gain insights to better
position yourself as the ideal
candidate.
Preparation is Key, But You Donʼt
Need to Know Everything
The key to success in all of these is to think in advance about the
business to such an extent that you’re able to essentially guess what
the case studies will be, so that you can narrow down the areas to
study.
I have a detailed video that goes over how to prepare for every phase
of the interview process, from the recruiter call to behavioral
questions to case studies. I would de nitely recommend checking
that out!
All of the advice in that video is the same for beginners, except that
you will have more times when you do not have relevant
experience to draw on. In those cases, don’t panic—pivot the
conversation to how you would approach the problem. For example:
“While I haven’t worked on a problem like this before, here’s how I
would break it down…”
This demonstrates critical thinking and a willingness to tackle new
challenges, both of which are valuable traits for any candidate.
It’s also ok to ask clarifying questions of the interviewer and
demonstrate your curiosity and growth mindset. Interviewers don’t
expect you to know everything, but they do want to see how you
think and approach complex problems.
Remember, every interview you attend—even if it doesn’t result in
an offer—is a valuable learning experience. The more you practice,
the more comfortable and con dent you’ll become.
How to Do Well When
You Start Your First
Job
Congratulations—you’ve landed your rst data science role! Starting
a new job can be intimidating, especially in a technical eld like this
one.
Here are some suggestions on how to do well when you start your
rst job:
• Learn the metrics that matter: Find out what KPIs (Key
Performance Indicators) your team or company cares about and
why.
• Understand the challenges: Read up on industry trends, internal
documentation, or past projects to get a sense of the business’s
pain points.
• Talk to stakeholders: Ask stakeholders or team members what
problems they’re trying to solve and how your work can help.
Building this understanding early on will guide your decisions
and make your contributions more impactful.
2.Ask Questions and Seek
Feedback
It’s okay not to know everything—no one expects you to at this
point. What matters is that you’re proactive about learning. Here’s
how to approach this:
• Ask questions: Whether it’s about the company’s processes, tools,
or expectations, don’t hesitate to seek clari cation. This shows
initiative and a desire to get things right.
• Get feedback often: Ask for feedback on your work from peers,
mentors, or managers. The sooner you know what you’re doing
well (and what you need to improve), the faster you’ll grow.
• Pair program with colleagues: This can help you learn coding
practices, tools, and work ows speci c to your company.
• Learn from code reviews: Take reviews seriously and see them as
opportunities to improve your skills.
• Find a mentor: Identify someone on your team who is
experienced and approachable, and learn from their guidance.
Building strong relationships with your colleagues will not only help
you learn faster but also create a supportive environment where you
can thrive.
4. Focus on Delivering Value
In your rst few months, aim to contribute to projects that provide
visible value, even if they’re small. For example:
Connect
ideas and
insights across personal
growth, professional
With Me
development, and the world of
AI and data science.
• If you'd like to chat with me
1:1, you can book a call here:
https://topmate.io/marina_wyss
• ❤ Or, nd me on Instagram:
link