0% found this document useful (0 votes)
22 views

Data Science_program (1)

Uploaded by

kristi spahi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views

Data Science_program (1)

Uploaded by

kristi spahi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Science Program

no of
l.p. form topic scope
hours
Introduction to Data Science: basic concepts, working environment, general
1 3 stationary Data Scientist - introduction
knowledge about Data Science
video GIT - video
working with the system console
System tools and GIT version vim editor
2 7 stationary practical exercises
control system
git version control system

Python Basics
Python object-oriented programming
3 28 stationary Python in Data Science
Python technology including Google Colab
Intermediate Python

I. Elements of linear algebra - using NumPy


- scalars, vectors, matrices, tensors
- matrix and vector operations (multiplication, addition)
II. Probability and information theory
- probability
4 14 stationary Statistics and probability
- random variables
- probability distributions
- conditional probability and Bayes' theorem
III. Descriptive statistics
IV. Statistical tests

- data frames - pandas library


- working with data, basic operations
- feature engineering
- pandas profiling
5 14 stationary Processing of data sets - working with files - serialization
- data download from API
- SQL language and databases
- database programming

- data visualization
- matplotlib and seaborn
6 14 stationary Data visualisation - ggplot and plotly
- descriptive statistics

- what is machine learning


- the ability of machines to learn
- supervised learning
- unsupervised learning
- incremental learning
Machine Learning in practice: - the main problems of machine learning
7 14 stationary regression problems in - supervised learning - regression problems
supervised learning a. linear regression
b. least squares method
c. gradient descent
d. polynomial regression
e. regression of decision trees
f. model evaluation methods (ROC curve, cross-validation)

Page 1
no of
l.p. form topic scope
hours

- data loading and processing


- EDA
- problem analysis
8 7 Practical project (regression)
- presentation of models solving the problem
- creating a basic model
- comparison of several approaches and drawing conclusions

- logistic regression
- binary classification
- multi-class classification
Machine Learning in practice: - k-nearest neighbors
stationary/r
9 21 classification problems in - support vector machines
emote
supervised learning - decision trees
- Naive Bayes
- model evaluation methods (ROC curve, cross-validation)
- xgboost

- data loading and processing


- EDA
Practical project - problem analysis
10 7
(classification) - presentation of models solving the problem
- creating a basic model
- compare several approaches and draw conclusions

- curse of dimensionality, PCA decomposition


- data clustering - k-means algorithm
Machine Learning in practice: - image segmentation as a clustering problem
11 14 stationary
unsupervised learning - data clustering - DBSCAN algorithm
- Gaussian mixed model

- computational graphs
- creating simple models
- saving and loading models
12 7 stationary TensorFlow Library - visualization of a computational graph
- practical project
- tf.data API

- biological neuron
- artificial neuron - perceptron
Introduction to artificial neural - training neural networks
13 7 stationary - introduction to deep learning
networks
- deep neural networks
- neural network hyperparameters

- problems of training deep neural networks


- vanishing/exploding gradient problem
14 7 stationary Training deep neural networks - weights initialization
- optimizers
- regularization methods

Page 2
no of
l.p. form topic scope
hours

- an overview of the architecture of the visual cortex


- construction of a digital image
- the concept of convolution operation
- CNN architecture
Image processing - Computer - convolution operation in the network
15 14 stationary - CNN filters
Vision
- MaxPooling
- image classifier
- transfer learning
- OpenCV library

- data loading and processing


- EDA
Practical project (image - problem analysis
16 7 stationary
processing) - presentation of models solving the problem
- creating a basic model
- compare several approaches and draw conclusions

- an introduction to NLP
- working with files and operations on the text
- working with the Spacy library
- Tokenization
17 14 stationary Natural language processing - Stemming
- Lemmatization
- text classification

- the concept of recursion


- sequence data examples
- recursive neuron
Working with sequences - - input and output sequences
18 14 stationary
recursive neural networks - data preparation for the network
- problem in remembering long dependencies by the network
- structure and application of LSTM and GRU cells
- embeddings, attention layer, natural language processing

- Design thinking - creating and inventing a solution to a problem based on DS.


- loading and processing of data from various sources
- EDA, understanding your data
19 21 stationary Final project - analysis of data problems
- presentation of models solving the problem
- creating a basic model
- improvement of the solution

234

Page 3

You might also like