Action PlanJournaling
Action PlanJournaling
Learning Path -
https://learn.365datascience.com/my-learning-path/
How I'd learn ML in 2025 (if I could start over)
YT20STL1KAS
Main Topic 1
https://app.datacamp.com/learn/career-tracks/data-scientist-in-python
Main Topic 2
https://app.datacamp.com/learn/career-tracks/associate-data-scientist-in-python
https://app.datacamp.com/learn/courses/introduction-to-tensorflow-in-python ---Started [Inprogress]
https://app.datacamp.com/learn/courses/introduction-to-statistics-in-python [ TBD]
Explicit tutorial for each these topics need to be done
Pandas[https://app.datacamp.com/learn/courses/data-manipulation-with-pandas]
List comprehension
Class Inheritance
https://app.datacamp.com/learn/courses/end-to-end-machine-learning [Advanced]
----------
Maths Important Topics
Ø How to do derivates and integrals
Ø What Vector and Matrices are and how their basic operations work
Ø The basic concepts behind probability theory
Ø Some basic rules of summation and logarithm
Main Topic 3
https://app.datacamp.com/learn/career-tracks/associate-python-developer
Subtopic
https://app.datacamp.com/learn/courses/introduction-to-python-for-developers -- Completed
https://app.datacamp.com/learn/courses/intermediate-python-for-developers ---
Started[Inprogress]
=================================================
Machine Learning in python - this is more related to the M-tech course
https://app.datacamp.com/learn/courses/supervised-learning-with-scikit-learn
================
https://www.coursera.org/learn/numpy-data-science [Data science with NumPy, Sets, and Dictionaries]
https://www.coursera.org/learn/machine-learning-with-python [ This will give basic machine learning
understanding - Machine learning in python via IBM]
Pandas
NumPy ---> Scientific Calculations
TensorFlow ---> Computational Calculations
Matplotlib ---> Visual representation
=================================================
Week 1: Foundations of Data Science
Day 1-2: Introduction to Data Science
• Learn: What is Data Science? Roles, tools, and applications.
• Resources:
○ Articles: Harvard Data Science Overview.
○ Video: "What is Data Science?" (Kaggle YouTube).
• Task: Write a one-page summary of Data Science.
Day 3-4: Learn Python Basics
• Topics: Variables, data types, loops, conditional statements.
• Resources:
○ Codecademy’s Python Course.
○ "Python for Data Science" playlist on YouTube.
• Task: Write a Python program to calculate the mean and median of a list of numbers.
Day 5-7: Data Manipulation with Pandas
• Topics: DataFrames, filtering, sorting, grouping, missing data handling.
• Resources:
○ Kaggle’s Pandas micro-course.
○ Book: "Python for Data Analysis" by Wes McKinney.
• Task: Analyze a sample dataset (e.g., Titanic dataset) using Pandas.
================================
Knowledge on recommendation system -
https://app.datacamp.com/learn/courses/building-recommendation-engines-in-python
2. Use GitHub to track and showcase your progress.
3. Participate in Kaggle competitions for practical experience.
4. Join Data Science communities for networking and advice.
Let me know if you’d like me to customize or add more resources to this plan!
================================
Knowledge on recommendation system -
https://app.datacamp.com/learn/courses/building-recommendation-engines-in-python
As the name indicates, recommendation systems assist you in predicting the future
preference of any product and recommending the best-suited items to users.
The technique where the system predicts whether an individual or a business likes
the product (a classification problem) or the reviews or ratings by them (a regression
problem) is known as the recommendation prediction problem.
Here, you will understand the procedure for using population averages.
-----------------------------
Ø Collaborative Filtering
Collaborative filtering is an aspect of recommendation systems with which we
interact quite frequently. Upon collecting data on the preferences of multiple users,
collaborative filtering makes predictions for the choice of a particular user.
Here, you will understand the procedure to use collaborative filtering with the help of
similar users.
Here, you will understand the procedure to use collaborative filtering with the help of
similar items.
Here, you will understand the procedure to use collaborative filtering with the help of
similar users and items.
-------------------------------------------
Ø Personalized Recommendations
As suggested by the name itself, personalized recommendations work to filter out
recommendations that are personally relevant for a user, based on their browsing
trends, etc.
Here, you will learn how to utilize personalization recommendations with the help of
comparisons, rankings, and user items.
• Hidden Markov Model / Neural Nets, Bipartite graph, and graphical model
The Hidden Markov Model (HMM) is a statistical Markov model in which the system
being modeled is regarded as a Markov process with hidden/unobserved states.
Here, you will learn how to utilize personalization recommendations with the help of
comparisons, rankings, and user items.
• Hidden Markov Model / Neural Nets, Bipartite graph, and graphical model
The Hidden Markov Model (HMM) is a statistical Markov model in which the system
being modeled is regarded as a Markov process with hidden/unobserved states.
This chapter will familiarize you with the procedure to use side information with the
assistance of Meta-Prod2Vec.
This chapter will familiarize you with the procedure to make a system considering
algorithmic and system challenges.
----------------------------------------------
MIT 12 weeks DatacScience program for industry adoption
https://idss-gl.mit.edu/dsml-program-preview?
enc_e_lid=W3MRxJJU14xooTB6Qy0FM9wrOIO9Nh14XiFBxPaXCD%
2Blu8iESCXaChC7Tgbl07eUBci7QGHwHV30TV60%2B8aOa1PUyKc%2B5qjSMnGb%2FY%2Fb%
2BGuB156k85zIA1%2Fgwa8lJlDVh%2FrX1twtFmAGe4%2FAwBFY2mcXvaU2sMWA0io%3D--
pZEXEPDNdVspASCo--%2BwBolkyIkFfGhYtNCpom8w%3D%3D
Python, for Data Scientists and Machine Learning specialists, is a lingua franca owing to
the immense promise of this widely-used programming language. To strengthen your
Python foundations, this module focuses on NumPy, Pandas, and Data Visualization.
• Numpy
Numpy is a Python package for scientific computing that enables one to work with multi-
dimensional arrays and matrices.
• Pandas
Pandas is an open-source and powerful library in Python that is used to analyze and
manipulate data.
• Data Visualization
Data Visualization means dealing with the graphic representation of data, which
effectively generates insights from data by using matplotlib, seaborn, etc., libraries.
===========================
• Descriptive Statistics
• Inferential Statistics
It will explore the areas of distributions and parameter estimation, ultimately allowing you
to make inferences from the data.
===================
Here, you will understand the procedure to implement linear regression with one and
several variables.
This chapter will familiarize you with the procedure to implement linear regression for
predictive analysis.
Here, you will understand the procedure to implement linear regression with one and several
variables.
This chapter will familiarize you with the procedure to implement linear regression for predictive
analysis.
This chapter will familiarize you with the procedure to implement linear regression for causal
inference.
Logistic regression is a simple classification algorithm in Machine Learning that predicts the
categorical dependent variables using independent variables.
This chapter will familiarize you with all the fundamentals of Logistic Regression and other types
of nonlinear regression in Machine Learning.
-----------------------------
This chapter will teach you the process of making good predictions with high-dimensional data.
Overfitting occurs when a model over-trains the data. In Layman's terms, suppose a model
learns the detail and noise within the training data. In that case, the training data will negatively
affect the performance of the model on new data.
This chapter will teach you the process of avoiding overfitting through validation and cross-
validation techniques.
Here, you will understand regularization by Lasso, Ridge, and their modification.
Regression Trees are built using binary recursive partitioning, an iterative process that splits the
data into partitions or branches. It later splits each portion into smaller groups as the process
advances every branch.
-------------------------
This chapter will teach you the process of identifying and working with Randomized Control
Trials.
---------------
experiments. Yet, it can easily pass unrecognized, although its recognition is essential for
significantly interpreting causal relationships, like evaluating treatment effects.
---------------