Becoming A Data Scientist StudyPlan
Becoming A Data Scientist StudyPlan
1. Mathematics
○ Linear Algebra (15 hours)
■ Study concepts like vectors, matrices, eigenvalues, and
eigenvectors.
■ Resources: Khan Academy's Linear Algebra course Khan
Academy - Linear Algebra
○ Calculus (15 hours)
■ Learn about differentiation, integration, limits, and
derivatives.
■ Resources: Khan Academy's Calculus courses Khan Academy
- Calculus
○ Probability and Statistics (15 hours)
■ Study probability theory, random variables,
distributions, and basic statistics.
■ Resources: Khan Academy's Probability and Statistics
courses Khan Academy - Probability and Statistics
2. Programming
○ Python (60 hours)
■ Syntax and Basic Concepts (10 hours)
■ Data Structures (15 hours)
■ Control Structures (10 hours)
■ Functions (10 hours)
■ Object-Oriented Programming (15 hours)
■ Resources: Python.org's official tutorial Python.org
Official Tutorial
○ R (optional) (20 hours)
■ If you choose R as well, allocate time for syntax and
basic concepts.
■ Resources: "R for Data Science" by Hadley Wickham and
Garrett Grolemund R for Data Science
1. Data Manipulation
○ Numpy (Python) (20 hours)
■ Learn how to work with arrays and matrices.
■ Resources: Numpy documentation Numpy Documentation
○ Pandas (Python) (30 hours)
■ Study data structures like Series and DataFrames for
data manipulation.
■ Resources: "Python for Data Analysis" by Wes McKinney
Python for Data Analysis
○ Dplyr (R) (20 hours)
■ If you chose R, learn data manipulation using dplyr.
■ Resources: DataCamp's "Introduction to the Tidyverse"
course Introduction to the Tidyverse
2. Data Visualization
○ Matplotlib (Python) (20 hours)
■ Start with basic plotting techniques.
■ Resources: Matplotlib documentation Matplotlib
Documentation
○ Seaborn (Python) (20 hours)
■ Explore more advanced and aesthetic visualizations.
■ Resources: Seaborn documentation Seaborn Documentation
○ ggplot2 (R) (20 hours)
■ If you chose R, learn data visualization using ggplot2.
■ Resources: "Data Visualization with ggplot2" by Hadley
Wickham Data Visualization with ggplot2
○ Interactive Visualization Tools (10 hours)
■ Explore libraries like Plotly and Bokeh for interactive
visualizations.
■ Resources: Plotly documentation Plotly Documentation