0% found this document useful (0 votes)

9 views

Data science curriculum

The document provides a comprehensive overview of data science, covering its relevance in society, various disciplines such as business analytics, machine learning, and artificial intelligence, as well as common techniques and tools used in the field. It also includes detailed sections on SQL databases, statistics, version control with Git and GitHub, Power BI, Python programming, data visualization libraries like Matplotlib and Seaborn, and machine learning concepts. Each section is structured to guide learners through foundational knowledge to advanced applications in data science.

Uploaded by

muheezadedejiokunade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Data science curriculum

Uploaded by

muheezadedejiokunade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

FOUNDATION

INTRODUCTION TO DATA SCIENCE

- Where data science fits into today’s society.

- Why are there so many business and data science buzzwords?
- Analysis vs Analytics
- Intro to Business Analytics, Data Analytics, and Data Science
- Adding Business Intelligence (BI), Machine Learning (ML), and Artificial
Intelligence (AI) to the picture
- The Relationship Between Different Data Science Field
- When are Traditional data, Big Data, BI, Traditional Data Science and ML applied?
- What Is The Purpose Of Each Data Science Field
- Why do we Need each of these Disciplines?
- Common Data Science Techniques
- Traditional Data: Techniques
- Traditional Data: Real-life Examples
- Big Data: Techniques
- Big Data: Real-life Examples
- Business Intelligence (BI): Techniques
- Business Intelligence (BI): Real-life Examples
- Traditional Methods: Techniques
- Traditional Methods: Real-life Examples
- Machine Learning (ML): Techniques
- Machine Learning (ML): Types of Machine Learning
- Machine Learning (ML): Real-life Examples
- Common Data Science Tools
- Programming Languages & Software Employed in Data Science - All the Tools You
Need
- Data Science Job Positions: What Do They Involve And What To Look Out For?
- Data Science Job Positions: What do they Involve and What to Look out for?
- Dispelling Common Misconceptions

SQL AND DATABASES FOR DATA SCIENCE
Getting Started & Installation
What Is A Database?
SQL vs. MySQL
Installation
Creating Databases & Tables
Showing Databases
Creating Databases
Dropping and Using Databases
Introducing Tables
Data Types: The Basics
Creating Tables
How Do We Know It Worked?
Dropping Tables
Tables Basics Activity
MySQL Comments
Inserting Data
INSERT: The Basics
A Quick Preview of SELECT
Multi-inserts
Working With NOT NULL
Sidenote: Quotes In MySQL
Adding DEFAULT Values
Introducing Primary Keys
Working With AUTO_INCREMENT

CRUD Basics
Introducing CRUD
Getting Our New "Dataset"
Officially Introducing SELECT
The WHERE clause
Aliases
Using UPDATE
A Quick Rule Of Thumb
Introducing DELETE
String Functions
The World Of String Functions
Loading Our Books Data
CONCAT
SUBSTRING
Combining String Functions
Sidenote: SQL Formatting
REPLACE
REVERSE
CHAR_LENGTH
UPPER & LOWER
Other String Functions
Refining Selections
Adding Some New Books
DISTINCT
ORDER BY
More On ORDER BY
LIMIT
LIKE
Escaping Wildcards
Aggregate Functions
Count Basics
GROUP BY
MIN and MAX Basics
Subqueries
Grouping By Multiple Columns
MIN and MAX With GROUP BY
SUM
AVG
Aggregate Functions Docs
Revisiting Data Types
Surveying Other Data Types
CHAR vs. VARCHAR
INT, TINYINT, BIGINT, etc.
DECIMAL
FLOAT & DOUBLE
DATE and TIME
Working With Dates
CURDATE, CURTIME, & NOW
Date Functions
Time Functions
Formatting Dates
Date Maths
TIMESTAMPS
DEFAULT & ON UPDATE TIMESTAMPS
Comparison & Logical Operators
Not Equal
NOT LIKE
Greater Than
Less Than Or Equal To
Logical AND
Logical OR
Between
Comparing Dates
The IN Operator
CASE
IS NULL
Constraints & ALTER TABLE
UNIQUE Constraint
CHECK Constraints
Named Constraints
Multiple Column Constraints
ALTER TABLE: Adding Columns
ALTER TABLE: Dropping Columns
ALTER TABLE: Renaming
ALTER TABLE: Modifying Columns
ALTER TABLE: Constraints
One to Many & Joins
Data is Messy
Relationships Basics
One to Many Relationship
Working with FOREIGN KEY
Cross Joins
Inner Joins
Inner Joins With Group By
Left Join
Left Join With Group By
Right Join
On Delete Cascade
Many to Many
Many to Many Basics
Creating Our Many To Many Tables
TV Series Challenge #1
TV Series Challenge #2
TV Series Challenge #3
TV Series Challenge #4
TV Series Challenge #5
TV Series Challenge #6
TV Series Challenge #7
Views, Modes, & More!13 lectures • 49min
Introducing Views
Updateable Views
Replacing/Altering Views
HAVING clause
WITH ROLLUP
SQL Modes Basics
STRICT_TRANS_TABLES
Slicer
Synchronising slicers to multiple pages
Slicer Warning
Adding more control to your visualisations - Filters and slicers
Sort visuals
Configure small multiples
Use Bookmarks for reports
Group and layer visuals by using the Selection pane
Adding more control to your visualisations
Drillthrough
Buttons and Actions
Page Navigation and Drill through actions
Enable Natural Language Queries (Ask A Question) and Page Formatting
Tooltip Pages
Page and Bookmark Navigator
Adding more control to your visualisations - Part
STATISTICS FOR DATA SCIENCE
- Introduction to Statistical Research Methods
- Data Visualization
- Measures of Central Tendency
- Variability
- Standardisation
- Normal Distribution
- Sampling Distributions
- Estimation
- Hypothesis Testing
- t-Tests
- One-way Analysis of Variance (ANOVA)
- Two-way Analysis of Variance (ANOVA)
- Correlation
- Regression
- Chi-Squared Tests

VERSION CONTROL - GIT AND GITHUB

- The Terminal
- Install Git Bash on Windows
- Introduction to Version Control and Git
- Version Control using Git and the Command Line
- Github and Remote Repositories
- Gitignore
- Cloning
- Branching and Merging
- Forking and Pull Requests
- Setting Up Comet

Power BI

1. Getting Started with Power BI:

- Understanding Power BI Desktop, Power BI Service, and Power BI Mobile

- Importing data from various sources (Excel, CSV, SQL Server, Web, etc.)

- Basic navigation and interface of Power BI Desktop

2. Data Preparation:

- Data cleaning and transformation using Power Query Editor

- Merging and appending queries

- Data types and error handling

3. Data Modeling:

- Creating relationships between tables

- Understanding and using star and snowflake schemas

- Managing relationships (one-to-one, one-to-many, many-to-many)

- Using calculated columns and tables

4. DAX (Data Analysis Expressions):

- Basics of DAX syntax and functions

- Creating calculated columns and measures

- Understanding row context and filter context

- Common DAX functions (SUM, COUNT, AVERAGE, MIN, MAX)

- Time intelligence functions (DATEADD, DATESYTD, SAMEPERIODLASTYEAR)

- Advanced DAX functions (CALCULATE, ALL, FILTER, RELATED)

5. Visualization:

- Creating and customizing basic charts (bar, line, pie, scatter, etc.)

- Using slicers for filtering data

- Creating and customizing tables and matrices

- Using maps and geographical data visualizations

- Custom visualizations from the marketplace

6. Advanced Visualization:

- Using bookmarks and selections for interactive reports

- Creating drill-through and drill-down reports

- Using tooltips for enhanced data presentation

- Implementing conditional formatting

7. Reports and Dashboards:

- Designing report layouts and themes

- Creating and managing dashboards in Power BI Service

- Pinning visuals to dashboards

- Using Q&A feature for natural language queries

8. Power BI Service:

- Publishing reports to Power BI Service

- Understanding workspaces, apps, and content packs

- Managing datasets and data refresh schedules

- Sharing reports and dashboards with stakeholders

- Collaborating with team members

9. Power BI Embedded:

- Integrating Power BI reports into applications

- Using Power BI REST API for automation

10. Security:

- Implementing row-level security (RLS)

- Managing roles and permissions

- Understanding and applying data protection and compliance measures

11. Performance Optimization:

- Optimizing data models for performance

- Using Performance Analyzer tool

- Best practices for efficient report design

12. Advanced Analytics:

- Using AI visuals (Key Influencers, Decomposition Tree, Q&A Visual)

- Integrating R and Python scripts in Power BI

- Implementing what-if parameters for scenario analysis

13. Power BI Integration:

- Connecting Power BI with other Microsoft services (Excel, Azure, SQL Server)

- Integrating with third-party tools and data sources

- Using Power Automate for workflow automation

14. Power BI Administration:

- Managing Power BI gateway for on-premises data sources

- Monitoring usage and performance

- Implementing governance and best practices for organisation-wide usage

15. Power BI Community and Resources:

- Participating in Power BI community forums and events

- Utilising Power BI documentation and learning resources

- Staying updated with new features and updates

PYTHON FOR DATA SCIENCE

Why Python Programming
- Introduction to Python and its popularity
- Python's use in various domains (Web development, Data science, Automation, etc.)
- Advantages of Python over other programming languages
- Python community and resources
Data Types and Operators
- Variables and data types (integers, floats, strings, booleans)
- Type conversion and casting
- Basic operators (arithmetic, comparison, logical)
- String manipulation and formatting
- Working with variables and constants
Data Structures in Python
- Lists: creation, indexing, slicing, and manipulation
- Tuples: immutability and use cases
- Dictionaries: key-value pairs and dictionary methods
- Sets: unique elements and set operations
- Lists vs. Tuples vs. Dictionaries vs. Sets
Control Flow
- Conditional statements (if, elif, else)
- Loops (for and while loops)
- Loop control statements (break, continue)
- Using loops for iteration and pattern printing
- Exception handling (try, except, finally)

Functions
- Defining and calling functions
- Parameters and arguments
- Return statements and function documentation (docstrings)
- Scope and lifetime of variables
- Lambda functions and built-in functions

Scripting:
- Reading and writing files
- Command-line arguments (sys.argv)
- Creating and running Python scripts
- Understanding shebang (#!/usr/bin/env python)
- Organising code into modules and packages
-
NUMPY FOR DATA SCIENCE
- Introduction to NumPy and its importance in data science
- Creating NumPy arrays
- Array indexing and slicing
- Array manipulation and broadcasting
- Mathematical operations with NumPy arrays
- Loading and saving data using NumPy
PANDAS FOR DATA WRANGLING
- Introduction to Pandas for data manipulation and analysis
- Series and DataFrame objects
- Loading data into Pandas
- Data exploration and basic statistics
- Data cleaning and handling missing values
- Data filtering, selection, and sorting
- Data visualisation with Pandas
- What is data wrangling and why is it important?
- Data acquisition methods (reading from files, web scraping, APIs)
- Data cleaning techniques (handling missing values, dealing with duplicates)
- Data transformation (reshaping data, merging and joining datasets)
- Data aggregation and grouping
- Data normalisation and scaling
- Dealing with outliers
- Handling categorical data (encoding and one-hot encoding)
- Date and time data manipulation
- Introduction to data quality and validation
- Advanced Pandas techniques for data manipulation (pivot tables, melt, stack, unstack)
- Combining and merging DataFrames (concatenation, merging on keys)
- Data filtering and selection (loc, iloc)
- Using Pandas functions to clean and transform data
- Handling missing data with Pandas
- Applying custom functions to data using Pandas

MATPLOTLIB
- Introduction to Matplotlib and its role in data visualisation
- Basic plotting with Matplotlib (line plots, scatter plots, bar charts)
- Customising plots (labels, titles, legends)
- Subplots and figure customization
- Advanced plotting techniques (histograms, box plots, heatmaps)
- Saving and exporting plots in different formats
SEABORN
- Introduction to Seaborn and its advantages over Matplotlib
- Seaborn's aesthetics and built-in themes
- Creating statistical visualisations (distribution plots, categorical plots)
- Visualising relationships (scatter plots, pair plots, heatmaps)
- Advanced customization and styling in Seaborn
- Combining Seaborn with Pandas DataFrames for effective data exploration

VISUALIZATION
- Univariate Exploration of Data
- In this lesson, you will see how you can use matplotlib and seaborn to produce
informative visualisations of single variables.
- Bivariate Exploration of Data
- Multivariate Exploration of Data
- Explanatory Visualisations

MACHINE LEARNING
ADVANCED REGRESSION
- Introduction To Machine Learning
- Predictive Modelling And Classification
- Assessing Accuracy And The Train-Test Split
- Statistical Learning
- Linear Models
- Least Squares Regression
- Splitting Datasets
- The Train/Test Split
- Multiple Linear Regression
- Multiple Linear Regression
- Variables And Variable Selection
- Feature Engineering
- Saving And Restoring Models
- Regularisation - Data Scaling
- Regularisation : Ridge Regression
- Regularisation : LASSO Regression
- Decision Trees
- Bias-Variance Tradeoff
- Parametric Methods, Ensembling And Bootstrapping
- Random Forests
ADVANCED CLASSIFICATION
- Advanced Classification
- Natural Language Processing
- How Machines Understand Language
- Logistic Regression
- Intro To Binary Classification Using Logistic Regression
- Classification Metrics
- Model Improvements
- Improving Classification Models
- Dealing With Imbalanced Data
- Tree-Based Classification Methods
- Training A Decision Tree
- Tree-Based Methods For Classification
- Support Vector Classification
- Support Vector Machines
- Nearest Neighbours And Naive Bayes
- KNNs And Naive Bayes
- Hyperparameter Tuning & Model Validation
- Hyperparameters And Model Validation
- Neural Network Classifiers
- Classifier Model Selection
- Build All The Classifiers

UNSUPERVISED LEARNING
- Principal Component Analysis
- Advanced Dimensionality Reduction
- Advanced Dimensionality Reduction Techniques
- K-Means Clustering
- Hierarchical Clustering
- Gaussian Mixture Models
- Clustering And Geospatial Analysis
- Recommender Systems
Introduction to Streamlit

○ What is Streamlit?
○ Installing Streamlit
○ Basic Streamlit Concepts: Widgets, Layouts, and State Management
○ Running and Sharing Streamlit Apps

Streamlit Components and Layouts

○ Advanced Layouts and Widgets

○ Creating Interactive User Interfaces
○ Integrating Plotly, Matplotlib, and Altair with Streamlit

Introduction to Big Data

○ What is Big Data?

○ Characteristics of Big Data (Volume, Velocity, Variety, Veracity)
○ Overview of Big Data Technologies (Hadoop, Spark, NoSQL)
○ Data Storage: HDFS, Cloud Storage

Data Wrangling with PySpark

● Topics Covered:
○ Introduction to Apache Spark
○ Working with PySpark DataFrames
○ Data Cleaning and Transformation with PySpark

Data Visualization for Big Data

● Topics Covered:
○ Visualisation Techniques for Large Datasets
○ Aggregation and Filtering in PySpark
○ Integrating PySpark with Streamlit for Real-Time Visualisations

Connecting Streamlit with Big Data Storage

● Topics Covered:
○ Connecting Streamlit to Cloud Storage (AWS S3, Google Cloud Storage)
○ Streaming Data into Streamlit from Big Data Sources
○ Real-time Data Processing with Kafka and Streamlit

Machine Learning on Big Data

● Topics Covered:
○ Introduction to Machine Learning on Big Data
○ Using MLlib with PySpark
○ Integrating Machine Learning Models in Streamlit

Advanced Streamlit Features

○ Custom Components in Streamlit

○ Deploying Streamlit Apps on Heroku, AWS, and Google Cloud
○ Streamlit Authentication and Security

Big Data Project Development

● Topics Covered:
○ Project Planning and Management
○ Integrating All Components: Data Ingestion, Processing, Visualization, and
Machine Learning
○ Optimising Streamlit Apps for Performance
Projects
The Blackjack Capstone Project
Higher Lower Game
Data Analysis of a CSV File
Obtain a dataset in CSV format (e.g., from Kaggle or other open datasets).
Use Pandas to load and clean the data.
Perform exploratory data analysis (EDA) using Pandas and NumPy to answer
questions and visualise patterns in the data.
Generate summary statistics, histograms, and other visualisations to gain insights
from the dataset.

Stock Portfolio Analysis

Retrieve historical stock price data using Pandas' data reader or API.
Create a Pandas DataFrame to store and manipulate the data.
Calculate and visualise portfolio statistics, such as returns, volatility, and risk-adjusted
performance.
Implement simple portfolio optimization strategies, such as the Markowitz Efficient
Frontier.

Customer Segmentation
Obtain a customer dataset (e.g., retail sales data or online store data).
Use Pandas to preprocess and clean the dataset.
Utilise NumPy for clustering algorithms like k-means to segment customers based on
their purchase behaviour.
Visualise customer segments and analyse their characteristics.

Time Series Forecasting

Collect a time series dataset (e.g., stock prices, weather data).
Load and manipulate the data with Pandas.
Use NumPy and Pandas to implement time series forecasting models like moving
averages, exponential smoothing, or ARIMA.
Visualise the time series data and the forecasted values.
Movie Recommender System
Acquire a movie ratings dataset (e.g., MovieLens dataset).
Clean and preprocess the data using Pandas.
Implement a basic movie recommender system using NumPy and Pandas, based on
user ratings and movie metadata.
Provide movie recommendations for a given user.

E-commerce Sales Analysis

Collect e-commerce sales data, including customer transactions and product
information.
Use Pandas for data cleaning and merging datasets.
Analyse sales trends, customer behaviour, and product performance using Pandas and
NumPy.
Create visualisations and reports to summarise the findings.

Data Cleaning and Transformation Tool

Develop a tool that allows users to upload messy datasets.
Use Pandas to clean and transform the data, addressing common data quality issues
like missing values, duplicates, and inconsistent formatting.
Provide options for data export in different formats (e.g., CSV, Excel) after cleaning.

House Price Prediction:

Utilise a dataset of housing prices, including features like square footage, number of
bedrooms, and location.
Build regression models (linear regression, decision tree regression, or random forest
regression) to predict house prices.
Evaluate and compare model performance using metrics like Mean Absolute Error
(MAE) and Root Mean Squared Error (RMSE).
Energy Consumption Forecasting
Gather time-series data on energy consumption along with weather-related features.
Develop a time series forecasting model (e.g., ARIMA, LSTM) to predict future
energy consumption.
Assess the accuracy of the model's predictions.
Stock Price Prediction
Collect historical stock price data for a specific company or stock market index.
Implement a time series regression model to predict future stock prices.
Evaluate the model's performance using metrics like Mean Squared Error (MSE) and
visualise the predictions.

Customer Churn Prediction

Work with customer data from a business (telecom, subscription service, etc.).
Create a classification model (logistic regression, random forest, or support vector
machine) to predict customer churn.
Evaluate the model's accuracy, precision, recall, and F1-score.

Sentiment Analysis on Social Media

Collect social media data (e.g., tweets or reviews) related to a product or topic of
interest.
Build a text classification model using techniques like natural language processing
(NLP) and sentiment analysis.
Analyse sentiment trends and sentiment distribution.

Image Classification (e.g., MNIST, CIFAR-10)

Use popular image datasets like MNIST or CIFAR-10.
Create a convolutional neural network (CNN) for image classification tasks.
Visualise the model's performance and make predictions on new images.

Customer Segmentation
Apply clustering algorithms like k-means or hierarchical clustering to segment
customers based on their purchasing behaviour.
Analyse customer segments and develop targeted marketing strategies.
Anomaly Detection in Network Traffic
Work with network traffic data and focus on anomaly detection.
Implement unsupervised learning techniques (e.g., isolation forests or autoencoders)
to identify unusual patterns or attacks in network traffic.
Topic Modeling for Text Data
Use a dataset of text documents (e.g., news articles, research papers).
Apply topic modelling techniques like Latent Dirichlet Allocation (LDA) to discover
underlying topics within the documents.

Market Basket Analysis

Work with transaction data from a retail store.
Use association rule mining (e.g., Apriori algorithm) to identify patterns in customer
purchasing behaviour.
Suggest product recommendations based on frequent itemsets.

Introduction to Streamlit

● Practical Exercise:
○ Build a basic Streamlit app that displays text, images, and charts.
● Assignment:
○ Create a simple dashboard with user inputs (e.g., sliders, checkboxes).

Streamlit Components and Layouts

● Practical Exercise:
○ Develop a Streamlit app with complex layouts and multiple interactive charts.
● Assignment:
○ Design a multi-page Streamlit app.

Introduction to Big Data

● Practical Exercise:
○ Explore a small dataset using traditional methods.
● Assignment:
○ Write a brief report on the challenges and opportunities of Big Data.
Data Wrangling with PySpark

● Practical Exercise:
○ Process a medium-sized dataset using PySpark.
● Assignment:
○ Clean and transform a dataset using PySpark and load it into a Streamlit app.

Data Visualization for Big Data

● Practical Exercise:
○ Visualize a large dataset in Streamlit using PySpark.
● Assignment:
○ Build a data dashboard in Streamlit that visualizes trends in a large dataset.

Connecting Streamlit with Big Data Storage

● Practical Exercise:
○ Set up a connection between Streamlit and a cloud storage service.
● Assignment:
○ Create a Streamlit app that pulls data from a cloud storage service and
visualizes it.

Machine Learning on Big Data

● Practical Exercise:
○ Build and deploy a machine learning model using PySpark and Streamlit.
● Assignment:
○ Develop a Streamlit app that allows users to train and test a machine learning
model on large datasets.

Advanced Streamlit Features

● Practical Exercise:
○ Create and deploy a Streamlit app with custom components.
● Assignment:
○ Secure a Streamlit app and deploy it to a cloud platform.

Big Data Project Development

● Practical Exercise:
○ Start working on a capstone project that integrates Streamlit and Big Data.
● Assignment:
○ Submit a project proposal outlining the scope, objectives, and technologies
used.

Capstone Project Presentation

● Practical Exercise:
○ Complete and present the capstone project.
● Assignment:
○ Submit the final project and present it to the class.

Proposal on Power BI
No ratings yet
Proposal on Power BI
9 pages
Power BI Guide
100% (3)
Power BI Guide
122 pages
Microsoft Fabric - James Serra - Public
No ratings yet
Microsoft Fabric - James Serra - Public
54 pages
DANLC Course Content
No ratings yet
DANLC Course Content
8 pages
Data Analytics Program 3
No ratings yet
Data Analytics Program 3
11 pages
DP 203SQL, Python, Power BI, Data Warehousing
No ratings yet
DP 203SQL, Python, Power BI, Data Warehousing
7 pages
Data Analytics Courses Provided by Need Data Community
No ratings yet
Data Analytics Courses Provided by Need Data Community
10 pages
Business Analytics Master
No ratings yet
Business Analytics Master
19 pages
Data Analyst or Bussiness Analyst
No ratings yet
Data Analyst or Bussiness Analyst
7 pages
Data Analytics Updated SA
No ratings yet
Data Analytics Updated SA
12 pages
Data Analytics & Engineering Bootcamp by aiexper career
No ratings yet
Data Analytics & Engineering Bootcamp by aiexper career
12 pages
Data Science 8752
No ratings yet
Data Science 8752
28 pages
An9kKtPhBtSmwO8GJT4tTN4f5ZnJ09tNx-CHSOtkaanNJLCuH2Gg3SrkSNb8do8RUvkPqlHanIeebxXd-aOz0JM48NyfXLd06LeEVoDyMVXbNByqQbaWw85N4qjonWM
No ratings yet
An9kKtPhBtSmwO8GJT4tTN4f5ZnJ09tNx-CHSOtkaanNJLCuH2Gg3SrkSNb8do8RUvkPqlHanIeebxXd-aOz0JM48NyfXLd06LeEVoDyMVXbNByqQbaWw85N4qjonWM
13 pages
Data Analyst Learning Path
No ratings yet
Data Analyst Learning Path
10 pages
Power BI & Data Analytics (1)tydhgfc
No ratings yet
Power BI & Data Analytics (1)tydhgfc
5 pages
Advanced Certification in Data Science (213 hours ) 75,999- (1)
No ratings yet
Advanced Certification in Data Science (213 hours ) 75,999- (1)
5 pages
Power BI SQL
No ratings yet
Power BI SQL
8 pages
Data Analytics Master Course Brochure
No ratings yet
Data Analytics Master Course Brochure
28 pages
Data Science and Data Analytics Brochure Welcome To RISE INSTITUTE 1
No ratings yet
Data Science and Data Analytics Brochure Welcome To RISE INSTITUTE 1
13 pages
Data Superstar Placement Assurance Program Brochure
No ratings yet
Data Superstar Placement Assurance Program Brochure
22 pages
Advanced Certification Course in Microsoft Power BI
No ratings yet
Advanced Certification Course in Microsoft Power BI
31 pages
Data Analytics Bootcamp Job-Ready Skills
No ratings yet
Data Analytics Bootcamp Job-Ready Skills
25 pages
Cientista de Dados - Curso
No ratings yet
Cientista de Dados - Curso
1 page
File 1700817831282
No ratings yet
File 1700817831282
4 pages
Data Analyst Roadmap 2024
No ratings yet
Data Analyst Roadmap 2024
14 pages
Course curriculum
No ratings yet
Course curriculum
7 pages
Data Analytics and Power BI Career Path Batch 3
No ratings yet
Data Analytics and Power BI Career Path Batch 3
33 pages
Planner-Combine
No ratings yet
Planner-Combine
10 pages
Data Analytics Duration
No ratings yet
Data Analytics Duration
18 pages
Geekster Data Science Brochure
No ratings yet
Geekster Data Science Brochure
18 pages
Complete Sylabus Advanced Excel, Power BI, SQL
No ratings yet
Complete Sylabus Advanced Excel, Power BI, SQL
7 pages
Data Science Slybus
No ratings yet
Data Science Slybus
23 pages
Data
No ratings yet
Data
14 pages
PowerBI Course Content Raj Cloud Technologies
No ratings yet
PowerBI Course Content Raj Cloud Technologies
6 pages
Data Science
100% (1)
Data Science
13 pages
Learn Data in 2024
No ratings yet
Learn Data in 2024
7 pages
Da Syllabus New
No ratings yet
Da Syllabus New
31 pages
Data Analyst - Outline
No ratings yet
Data Analyst - Outline
8 pages
SQL Description
No ratings yet
SQL Description
7 pages
Advanced Certification in Data Analytics ( 108 hours ) 44,999-
No ratings yet
Advanced Certification in Data Analytics ( 108 hours ) 44,999-
3 pages
Data Analytics Certificate Course
No ratings yet
Data Analytics Certificate Course
15 pages
Data Analyst - Bhavesh Kumar
No ratings yet
Data Analyst - Bhavesh Kumar
3 pages
Adv Data Analytics Training
No ratings yet
Adv Data Analytics Training
15 pages
4a98c153-6306-4946-99d7-3db7233e3567_Data-Analyst-Roadmap
No ratings yet
4a98c153-6306-4946-99d7-3db7233e3567_Data-Analyst-Roadmap
6 pages
Data Analytics Online Certification (1) (1) (4)
No ratings yet
Data Analytics Online Certification (1) (1) (4)
10 pages
Data Analyst Bootcamp 19
No ratings yet
Data Analyst Bootcamp 19
10 pages
Data Analytics (1)
No ratings yet
Data Analytics (1)
22 pages
Data Analytics Chennai
No ratings yet
Data Analytics Chennai
20 pages
Data Science - Toc (1)
No ratings yet
Data Science - Toc (1)
5 pages
Course Content - SQL MSPBI
No ratings yet
Course Content - SQL MSPBI
12 pages
Data-Analyst-Training
No ratings yet
Data-Analyst-Training
9 pages
Data Analyst Course 2
No ratings yet
Data Analyst Course 2
11 pages
ALX Data Analytics Program Description
No ratings yet
ALX Data Analytics Program Description
6 pages
Data Analyst Roadmap 2025?
No ratings yet
Data Analyst Roadmap 2025?
11 pages
IBA - SYLLABUS - Data Manipulation and Visualization
No ratings yet
IBA - SYLLABUS - Data Manipulation and Visualization
5 pages
Power BI 101 - Brochure New
No ratings yet
Power BI 101 - Brochure New
8 pages
Data Analyst Roadmap -Mansi Patel
No ratings yet
Data Analyst Roadmap -Mansi Patel
10 pages
360 DigiTMG Data Analytics Course Syllabus
No ratings yet
360 DigiTMG Data Analytics Course Syllabus
22 pages
IC Outlines for Data Science Machine Learning
No ratings yet
IC Outlines for Data Science Machine Learning
19 pages
Tech Launch Program Data science
No ratings yet
Tech Launch Program Data science
22 pages
Data Analytics With Excel
No ratings yet
Data Analytics With Excel
3 pages
Access 2016: Up To Speed
From Everand
Access 2016: Up To Speed
R.M. Hyttinen
5/5 (2)
SQL Server 2014 Development Essentials
From Everand
SQL Server 2014 Development Essentials
Basit A. Masood-Al-Farooq
4.5/5 (2)
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics 3rd Edition Dan Clark - Download the full ebook now for a seamless reading experience
100% (1)
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics 3rd Edition Dan Clark - Download the full ebook now for a seamless reading experience
70 pages
MB-700-Demo
No ratings yet
MB-700-Demo
43 pages
Report
No ratings yet
Report
16 pages
Evolving The Microsoft Partner Network Programs: Solutions Partner For Data & AI (Azure) Walking Deck
No ratings yet
Evolving The Microsoft Partner Network Programs: Solutions Partner For Data & AI (Azure) Walking Deck
27 pages
PowerBI_Banking_Project
No ratings yet
PowerBI_Banking_Project
14 pages
ShishankResume
No ratings yet
ShishankResume
1 page
pl-900 1
No ratings yet
pl-900 1
43 pages
Trainity
No ratings yet
Trainity
12 pages
Power BI MCQ
No ratings yet
Power BI MCQ
31 pages
Lab 0 - Prerequisites and Document Structure
No ratings yet
Lab 0 - Prerequisites and Document Structure
6 pages
power-bi-developer-embedded
No ratings yet
power-bi-developer-embedded
343 pages
Raghav Khandelwal - Batch 2025 - B.tech - ECE
No ratings yet
Raghav Khandelwal - Batch 2025 - B.tech - ECE
2 pages
Curriculum MBA- HRA
No ratings yet
Curriculum MBA- HRA
14 pages
Lucky
No ratings yet
Lucky
6 pages
Pallavi d Resume
No ratings yet
Pallavi d Resume
1 page
Power BI Desktop End User Guide
No ratings yet
Power BI Desktop End User Guide
21 pages
Dashboard Reports Powerbi Sharepoint
No ratings yet
Dashboard Reports Powerbi Sharepoint
15 pages
Dr. Vishal Shukla
No ratings yet
Dr. Vishal Shukla
9 pages
Lakehouse End-To-End Scenario - Overview and Architecture - Microsoft Fabric - Microsoft Learn
No ratings yet
Lakehouse End-To-End Scenario - Overview and Architecture - Microsoft Fabric - Microsoft Learn
8 pages
Answer Summary Powerbi
No ratings yet
Answer Summary Powerbi
65 pages
Microsoft Transcender pl-300 PDF 2022-Sep-16 by Giles 162q Vce
No ratings yet
Microsoft Transcender pl-300 PDF 2022-Sep-16 by Giles 162q Vce
21 pages
Fundamentals of Data Analytics (In-Person) Course Outline
No ratings yet
Fundamentals of Data Analytics (In-Person) Course Outline
3 pages
Becoming a Power BI Professional
No ratings yet
Becoming a Power BI Professional
4 pages
Pro Power BI Theme Creation: JSON Stylesheets for Automated Dashboard Formatting 2nd Edition Adam Aspin 2024 scribd download
100% (3)
Pro Power BI Theme Creation: JSON Stylesheets for Automated Dashboard Formatting 2nd Edition Adam Aspin 2024 scribd download
76 pages
3rd Unit - DA
No ratings yet
3rd Unit - DA
20 pages
PDF Microsoft Excel Pivot Table Data Crunching (Office 2021 and Microsoft 365) (Business Skills) 1st Edition Bill Jelen download
100% (1)
PDF Microsoft Excel Pivot Table Data Crunching (Office 2021 and Microsoft 365) (Business Skills) 1st Edition Bill Jelen download
65 pages
Fin Irjmets1711372102
No ratings yet
Fin Irjmets1711372102
3 pages

Data science curriculum

Uploaded by

Data science curriculum

Uploaded by

FOUNDATION

INTRODUCTION TO DATA SCIENCE

- Where data science fits into today’s society.

VERSION CONTROL - GIT AND GITHUB

1. Getting Started with Power BI:

- Understanding Power BI Desktop, Power BI Service, and Power BI Mobile

- Basic navigation and interface of Power BI Desktop

- Data cleaning and transformation using Power Query Editor

- Merging and appending queries

- Data types and error handling

- Creating relationships between tables

- Understanding and using star and snowflake schemas

- Managing relationships (one-to-one, one-to-many, many-to-many)

- Using calculated columns and tables

4. DAX (Data Analysis Expressions):

- Basics of DAX syntax and functions

- Creating calculated columns and measures

- Common DAX functions (SUM, COUNT, AVERAGE, MIN, MAX)

- Time intelligence functions (DATEADD, DATESYTD, SAMEPERIODLASTYEAR)

- Advanced DAX functions (CALCULATE, ALL, FILTER, RELATED)

- Using slicers for filtering data

- Creating and customizing tables and matrices

- Using maps and geographical data visualizations

- Custom visualizations from the marketplace

- Using bookmarks and selections for interactive reports

- Creating drill-through and drill-down reports

- Using tooltips for enhanced data presentation

- Implementing conditional formatting

7. Reports and Dashboards:

- Designing report layouts and themes

- Creating and managing dashboards in Power BI Service

- Pinning visuals to dashboards

- Using Q&amp;A feature for natural language queries

- Publishing reports to Power BI Service

- Understanding workspaces, apps, and content packs

- Sharing reports and dashboards with stakeholders

- Collaborating with team members

- Integrating Power BI reports into applications

- Using Power BI REST API for automation

- Implementing row-level security (RLS)

- Managing roles and permissions

- Understanding and applying data protection and compliance measures

11. Performance Optimization:

- Optimizing data models for performance

- Using Performance Analyzer tool

- Best practices for efficient report design

12. Advanced Analytics:

- Using AI visuals (Key Influencers, Decomposition Tree, Q&amp;A Visual)

- Integrating R and Python scripts in Power BI

- Implementing what-if parameters for scenario analysis

13. Power BI Integration:

- Integrating with third-party tools and data sources

- Using Power Automate for workflow automation

- Managing Power BI gateway for on-premises data sources

- Monitoring usage and performance

- Implementing governance and best practices for organisation-wide usage

15. Power BI Community and Resources:

- Participating in Power BI community forums and events

- Utilising Power BI documentation and learning resources

- Staying updated with new features and updates

PYTHON FOR DATA SCIENCE

Streamlit Components and Layouts

○ Advanced Layouts and Widgets

Introduction to Big Data

○ What is Big Data?

Data Wrangling with PySpark

Data Visualization for Big Data

Connecting Streamlit with Big Data Storage

Machine Learning on Big Data

Advanced Streamlit Features

○ Custom Components in Streamlit

Big Data Project Development

Stock Portfolio Analysis

Time Series Forecasting

E-commerce Sales Analysis

Data Cleaning and Transformation Tool

- Using Q&A feature for natural language queries

- Using AI visuals (Key Influencers, Decomposition Tree, Q&A Visual)