0% found this document useful (0 votes)
128 views4 pages

5-Week Data Science Bootcamp Detailed Syllabus

The 5-week Data Science Bootcamp offers free online training with a focus on building data culture through structured learning modules and practical challenges. Participants will learn essential data science skills, including Python programming, data analysis, machine learning, and real-world applications. The bootcamp includes live mentorship sessions and utilizes Discord for real-time communication and support.

Uploaded by

John Dale Vacaro
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
128 views4 pages

5-Week Data Science Bootcamp Detailed Syllabus

The 5-week Data Science Bootcamp offers free online training with a focus on building data culture through structured learning modules and practical challenges. Participants will learn essential data science skills, including Python programming, data analysis, machine learning, and real-world applications. The bootcamp includes live mentorship sessions and utilizes Discord for real-time communication and support.

Uploaded by

John Dale Vacaro
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

5-Week Data Science Bootcamp

DETAILED SYLLABUS

Overview

In our endeavour to build data culture and democratize Data Science learning, we are

launching a 5-week Data Science Bootcamp with the help of resources contributed by

academia and industry experts. The online bootcamp will have a series of day-wise learning

modules along with intuitive practice quizzes/challenges.

This is a community initiative, driven by experts and mentors, and you have the

opportunity to attend it for free.

Prerequisites

● Nil, anyone with a passion for learning can make it to the finish line :)

Format

Tutors will provide learners with guided learning paths, resources and exercises to solve. The
entire schedule, practical details, registration details will be put up very soon. A brief summary
of the format can be found below:

● Day-wise modules: Trainers will post day-wise challenges and learning modules (mostly
some of the best-curated content available on the internet that would allow you to have
a structured learning path)

1 dphi.tech <Democratizing Data Science Learning>


● For real-time communication, we will be using Discord. This medium will help learners
to clear doubts on a real-time basis if they are stuck somewhere. In addition, this will
also allow learners to interact with the mentors and fellow learners
● Live doubt clearing and mentorship sessions will be organized every week based on the
requirements of the learners

Schedule

Week #0 - Python Crash Course and Intro to Data Science (Optional)

● Intro to Data Science - its prominence and use-cases


● Environment setup - python installation - anaconda ide
● Python for Data Science
○ Basics of Python
■ Print a string "Hello World"
■ Python basic syntax
■ Data structures and types
○ Python Lists & Strings
○ Intro to Functions
○ Brief Intro to Python Libraries for Data Science - Numpy and Pandas

Week #1 - Data Analysis and Data Visualization (Release on: 11th March)

● Dive Deep into Numpy and Pandas libraries


● Python Web Scraping
● Exploratory Data Analysis
● Intro to Data Visualization
● Graded Quiz 1 - 18th March

Week #2 - Advanced Exploratory Analysis and Data Pre-Processing (Release on:


18th March)
(Data Cleaning, Outlier detection etc.)

● Basic Statistics
● Charts and Visualization

2 dphi.tech <Democratizing Data Science Learning>


● Outlier Analysis
● Handling Missing Values
● Handling Imbalanced datasets, Oversampling - SMOTE
● Standardization/Normalization of data - what, why and when?
● Graded Quiz 2

Week #3 - Feature Selection and Building ML Models (Release on: 25th March)

● Intro to feature extraction and feature selection - explain how they are different
● Elaborate more on Feature Extraction
● Feature selection and its importance
○ Various feature selection/engineering techniques
○ Boruta
● Building efficient and effective models
● Splitting data into test and train datasets
● ML Algorithms:
○ Linear Regression
○ Logistic Regression
○ Cost function & Gradient Descent
● Overfitting & Underfitting

Week #4 - Model tuning and ML Algorithms (Release on: 1st April)

● Other ML Algorithms
○ Tree-based models
■ Decision trees
■ Random forest
■ A brief intro to other boosting and bagging techniques/algorithms
● Model tuning
○ Hyperparameter tuning
○ Evaluation Metrics (Model evaluation)
● Project - solve real-world data science problem on Ed-tech and Fintech.
● Graded Assignment (Released around 8th April)

Week #5 - Applied Data Science & ML - Problem-solving (Release on: 8th April)

● HR Analytics problem - predicting employee churn


● Ed-tech customer analysis - predicting user churn

3 dphi.tech <Democratizing Data Science Learning>


● Fraud analytics - predicting fraud detection
● Anti-money laundering analytics - predicting money laundering cases in transactions
data
● Real-estate price analysis - problem
● Getting started with Data Science competitions - Kaggle

4 dphi.tech <Democratizing Data Science Learning>

You might also like