1 Introduction
1 Introduction
1
(1) Introduction
2
Introduction
• Data science
• Statistics
• Exercises
3
Introduction
• Data science
• Statistics
• Exercises
4
Data science
Data science
The art of creating meaning from data
5
Data science
6
Data science
7
Data science
Science
Model formulation
Model evaluation
8
Data science
9
Data science
10
Data Science
www.instagram.com/sandserifcomics/
11
Data Science
www.oak-tree.tech/blog/ml-models
12
Introduction
• Data science
• Statistics
• Exercises
13
Statistics
Statistics
The art of creating meaning from data
and quantifying its associated uncertainty
14
Statistics
Confidence intervals
Estimation theory
Approximate inference
Frequentist statistics
Bayesian filtering
Decision theory
15
Statistics
16
Statistics
17
Statistics
18
Statistics
• Statistical procedures are designed to have good long run frequency properties
and are typically assessed by studying their sampling distributions.
19
Statistics
• Probabilistic statements about parameters are made with the help of probability
distributions, from which further inferences, such as point or interval estimates,
can be derived.
20
Statistics
21
Statistics
22
Introduction
• Data science
• Statistics
• Exercises
23
Statistics for Data Science
24
Statistics for Data Science
Key references
Leonhard Held
Daniel Sabanés Bové
Applied
Statistical
Inference
Likelihood and Bayes
25
Statistics for Data Science
Course components
Component Aims
26
Statistics for Data Science
27
Statistics for Data Science
28
Statistics for Data Science
• Binary feedback will be provided only on the Tuesday before the presentation
• The programming exercise pool provided with the lecture slides is final
29
Introduction
• Data science
• Statistics
• Exercises
30
Exercises
Study questions
1. Give a definition of Data Science.
2. Give a definition of Statistics.
3. Name three central postulates of Probability theory.
4. Name three central postulates of Frequentist inference.
5. Name three scientists involved in the development of Frequentist statistics.
6. Name three central postulates of Bayesian inference.
7. Name three scientists involved in the development of Bayesian statistics.
8. Name five typical topics in Statistics.
9. Name three topics commonly discussed in Machine Learning.
10. Name three topics commonly discussed in Artificial Intelligence.
31
Exercises
Programming exercises
1. Sample a univariate Gaussian using scipy.stats.
2. Evaluate the PDF of a univariate Gaussian using scipy.stats.
3. Visualize the PDF of a univariate and a normalized sample histogram of samples
from a univariate Gaussian with identical parameters on top of each other using
Matplotlib.
32