Syllabus: 1. Introduction To Data Science
Syllabus: 1. Introduction To Data Science
What is data science, relation to data mining, machine learning, big data and
statistics
Motivating examples
Why is it interesting?
Practical information
Simple visualizations
o
Histograms
Boxplots
Scatterplots
Time series
Spatial data
Case studies
o
X & Y examples
Medical data
Examples
Prediction algorithms
o
Decision trees
Rule learners
Linear/logistic regression
Combining classifiers
Experimental setup
o Training, tuning, test data
o
Interpretation of results
o
5. DATA ENGINEERING
Attribute selection
o Filter methods
o
Wrapper methods
Data discretization
Unsupervised discretization
Supervised discretization
Data transformations
o
Exercises
Introduction
o Probabilities
o
Naive Bayes
o
Bayesian Networks
o
Graphical representation
Temporal models
o
Markov Chains
In detail: Apriori
Clustering
o
What is clustering?
Eve, the Pharmaceutical Robot Scientist: Data Science for Drug Discovery
Data science for sports analytics
9. CHALLENGE
Introduction
Hands-on by participants
Discussion of results