Data Science Seminar
Data Science Seminar
Source: CapGeminii
Source: McKinsey Global Institute – The Age of Analytics: Competing in a Data Driven World (2016)
Source: McKinsey Global Institute – The Age of Analytics: Competing in a Data Driven World (2016)
$16.9 Billion
Industry
CS 561, Lecture 1
CS 561, Lecture 1
Business Data
Analytics Archiving
Data
y y
x x
y y
x x
y y
x x
y y
x x
y y
x x
- 0
-0.75 -0.25 0.25 0.75 1
1
inverse direct
perfect no relation perfect
linear linear
correlation correlation
x x x
r = -1 r = -.6 r=
y 0
y
(4) (5)
x x
r = +.3 r = +1
where:
r = Sample correlation coefficient
n = Sample size
x = Value of the independent variable
y = Value of the dependent variable
Tree
Height,
y
Correlation between
Tree Height and Trunk Diameter
9-42
y
Observed Value
of y for xi
Slope = β1
Predicted Value
of y for xi
Intercept = β0
x x
i
Value of X for
observation i
Dependent Independent
variable variables
The Jupyter Notebook App can be executed on a local desktop requiring no internet
access or it can be installed on a remote server and accessed through the internet.
Best of all, as part of the open source Project Jupyter, they are completely free.
56
QUESTIONS???