0% found this document useful (0 votes)

58 views4 pages

2-Scatterplots and Correlation

This document discusses scatterplots and correlation. It explains that scatterplots show the relationship between two numeric variables by plotting each data point based on its values for each variable. It provides examples of creating scatterplots in matplotlib and seaborn that show negative and positive correlations. It also discusses fitting a regression line to the scatterplot and transforming the data when the relationship is not linear.

Uploaded by

Lionel Yepdieu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views4 pages

2-Scatterplots and Correlation

Uploaded by

Lionel Yepdieu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Scatterplots and Correlation

classroom.udacity.com/nanodegrees/nd089/parts/8de94dee-7635-43b3-9d11-5e4583f22ce3/modules/c7f6b93a-
28e1-46b3-97c2-997a7eeddbf3/lessons/0491d74e-dcd8-4700-a971-a7f1b0a26ddb/concepts/9d1316b3-f339-4d52-
b63f-91994aefdd40

Watch Video At: https://youtu.be/wqMwTDVT9_Y

Watch Video At: https://youtu.be/wBDC5AmYgyg

Scatterplots

1/4
If we want to inspect the relationship between two numeric variables, the standard choice
of plot is the scatterplot. In a scatterplot, each data point is plotted individually as a
point, its x-position corresponding to one feature value and its y-position corresponding
to the second.

matplotlib.pyplot.scatter()
One basic way of creating a scatterplot is through Matplotlib's scatter function:

Example 1 a. Scatter plot showing negative correlation between two

variables

# TO DO: Necessary import

# Read the CSV file

fuel_econ = pd.read_csv('fuel_econ.csv')
fuel_econ.head(10)

# Scatter plot
plt.scatter(data = fuel_econ, x = 'displ', y = 'comb');
plt.xlabel('Displacement (1)')
plt.ylabel('Combined Fuel Eff. (mpg)')

In the example above, the relationship between the two variables is negative because as
higher values of the x-axis variable are increasing, the values of the variable plotted on the
y-axis are decreasing.

Alternative Approach - seaborn.regplot()

Seaborn's regplot() function combines scatterplot creation with regression function
fitting:

Example 1 b. Scatter plot showing negative correlation between two

variables

2/4
sb.regplot(data = fuel_econ, x = 'displ', y = 'comb');
plt.xlabel('Displacement (1)')
plt.ylabel('Combined Fuel Eff. (mpg)')

The basic function parameters, "data", "x", and "y" are the same for regplot as they are
for matplotlib's scatter .

The regression line in a scatter plot showing a negative correlation between the two
variables.

Example 2. Scatter plot showing a positive correlation between two

variables

Let's consider another plot shown below that shows a positive correlation between two
variables.

The regression line in a scatter plot showing a positive correlation between the two
variables.

3/4
In the scatter plot above, by default, the regression function is linear and includes a
shaded confidence region for the regression estimate. In this case, since the trend looks
like a \text{log}(y) \propto xlog(y)∝x relationship (that is, linear increases in the value of
x are associated with linear increases in the log of y), plotting the regression line on the
raw units is not appropriate. If we don't care about the regression line, then we could set
fit_reg = False in the regplot function call.

You can even plot the regression line on the transformed data as shown in the example
below. For transformation, use a similar approach as you've learned in the last lesson.

Example 3. Plot the regression line on the transformed data

def log_trans(x, inverse = False):

if not inverse:
return np.log10(x)
else:
return np.power(10, x)

sb.regplot(fuel_econ['displ'], fuel_econ['comb'].apply(log_trans))
tick_locs = [10, 20, 50, 100]
plt.yticks(log_trans(tick_locs), tick_locs);

Note - In this example, the x- and y- values sent to regplot are set directly as Series,
extracted from the dataframe.

Regression line on a scattered plot based on the log-transformed data

Supporting Materials

fuel_econ.csv

4/4

Lec 20
No ratings yet
Lec 20
24 pages
Module1 DS
No ratings yet
Module1 DS
61 pages
Topic4 Linear Models
No ratings yet
Topic4 Linear Models
72 pages
R-Unit 5
No ratings yet
R-Unit 5
76 pages
Co 2 Multivariate Analysis
No ratings yet
Co 2 Multivariate Analysis
71 pages
Ds Practical
No ratings yet
Ds Practical
25 pages
Chapt04 BPS
No ratings yet
Chapt04 BPS
26 pages
Ventures Regression
No ratings yet
Ventures Regression
19 pages
Math10 Week9.3
No ratings yet
Math10 Week9.3
15 pages
Ibrokhimovkhusnidin
No ratings yet
Ibrokhimovkhusnidin
9 pages
10 Must-Know Seaborn Visualization Plots For Multivariate Data Analysis in Python - by Susan Maina - Towards Data Science
No ratings yet
10 Must-Know Seaborn Visualization Plots For Multivariate Data Analysis in Python - by Susan Maina - Towards Data Science
39 pages
List of Functions
No ratings yet
List of Functions
7 pages
Calculating Pearson Correlation Coefficient in Python With Numpy
No ratings yet
Calculating Pearson Correlation Coefficient in Python With Numpy
6 pages
What Is A Scatter Plot?
No ratings yet
What Is A Scatter Plot?
3 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
MDM 05-1
No ratings yet
MDM 05-1
2 pages
Scatter PLOTS - 20 Jan 2023
No ratings yet
Scatter PLOTS - 20 Jan 2023
23 pages
MDM - 05 1 1
No ratings yet
MDM - 05 1 1
2 pages
Correlation and Regression
No ratings yet
Correlation and Regression
2 pages
MDM - 04 1 1
No ratings yet
MDM - 04 1 1
2 pages
Solutions For QB3
No ratings yet
Solutions For QB3
14 pages
Quantitative Techniques For Management PDF
88% (8)
Quantitative Techniques For Management PDF
507 pages
Document Sans Titre
No ratings yet
Document Sans Titre
1 page
Unit 6 - Scatter Plots & Data Analysis
No ratings yet
Unit 6 - Scatter Plots & Data Analysis
13 pages
Data Science Unit 2-11-08 2023
No ratings yet
Data Science Unit 2-11-08 2023
78 pages
Ex 4
No ratings yet
Ex 4
4 pages
Chapter2-ESTA3042 2020S2
No ratings yet
Chapter2-ESTA3042 2020S2
80 pages
Bi Variate 1
No ratings yet
Bi Variate 1
75 pages
Scatterplots and Regression
No ratings yet
Scatterplots and Regression
17 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
DA Manual - Part B
No ratings yet
DA Manual - Part B
13 pages
How To Interpret A Correlation Coefficient R
No ratings yet
How To Interpret A Correlation Coefficient R
2 pages
A Scatter Plot
No ratings yet
A Scatter Plot
5 pages
Unit 2
No ratings yet
Unit 2
36 pages
Bivariate EDA and Regression Analysis
No ratings yet
Bivariate EDA and Regression Analysis
61 pages
Statistics Regression Final Project
100% (2)
Statistics Regression Final Project
12 pages
Ad3411 - Data Science and Analytics Laboratory
No ratings yet
Ad3411 - Data Science and Analytics Laboratory
26 pages
Bivariate Analysis
No ratings yet
Bivariate Analysis
24 pages
BA 216 Lecture 5 Notes
No ratings yet
BA 216 Lecture 5 Notes
31 pages
02 Correlation Coefficient and The Residual
No ratings yet
02 Correlation Coefficient and The Residual
10 pages
Dsa Lab Manual
No ratings yet
Dsa Lab Manual
17 pages
Code Shabab Error 7
No ratings yet
Code Shabab Error 7
5 pages
Scatter Diagram
No ratings yet
Scatter Diagram
6 pages
Notes 2 - Scatterplots and Correlation
No ratings yet
Notes 2 - Scatterplots and Correlation
6 pages
Ch. 7: Scatterplots, Association, and Correlation
No ratings yet
Ch. 7: Scatterplots, Association, and Correlation
4 pages
Lec 19
No ratings yet
Lec 19
14 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
DVPD Final Lab Word PDF
No ratings yet
DVPD Final Lab Word PDF
93 pages
8537ADS Experiment 03
No ratings yet
8537ADS Experiment 03
4 pages
16 Mark Ds
No ratings yet
16 Mark Ds
18 pages
MDM4U Unit1 CorrelationSE
No ratings yet
MDM4U Unit1 CorrelationSE
3 pages
Unit V Notes
No ratings yet
Unit V Notes
11 pages
Alg 2.2 2.6 Originals
No ratings yet
Alg 2.2 2.6 Originals
20 pages
Python Machine Learning Linear Regression
No ratings yet
Python Machine Learning Linear Regression
1 page
TEACHING NOTES Section 6
No ratings yet
TEACHING NOTES Section 6
6 pages
Exploratory PDF
No ratings yet
Exploratory PDF
20 pages
Set 3 K@mpoi Algebra 2022 - Jawapan
No ratings yet
Set 3 K@mpoi Algebra 2022 - Jawapan
12 pages
Ridge Regression
No ratings yet
Ridge Regression
82 pages
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
Schedule Risk Analysis
No ratings yet
Schedule Risk Analysis
40 pages
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
3 pages
FDS Iat-2 Part-B
No ratings yet
FDS Iat-2 Part-B
4 pages
AnsSol JEEMain 2023 PH 2-10-04 2023 Evening Paper
100% (1)
AnsSol JEEMain 2023 PH 2-10-04 2023 Evening Paper
23 pages
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
100% (1)
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
9 pages
Errata Pages For Statics
No ratings yet
Errata Pages For Statics
10 pages
The IMA Volumes in Mathematics and Its Applications: Avner Friedman Willard Miller, JR
No ratings yet
The IMA Volumes in Mathematics and Its Applications: Avner Friedman Willard Miller, JR
172 pages
KNN (K Nearest Neighbor)
No ratings yet
KNN (K Nearest Neighbor)
21 pages
Ordered Pair:-An Ordered Pair Consist of Two Elements in A Fixed Order
No ratings yet
Ordered Pair:-An Ordered Pair Consist of Two Elements in A Fixed Order
19 pages
Thermal Deformation Analysis of Automotive Disc Brake Squeal
No ratings yet
Thermal Deformation Analysis of Automotive Disc Brake Squeal
26 pages
Combinations With Repetitions
0% (1)
Combinations With Repetitions
102 pages
ANSYS CFX Tutorials
No ratings yet
ANSYS CFX Tutorials
610 pages
New Microsoft Office PowerPoint Presentation
No ratings yet
New Microsoft Office PowerPoint Presentation
27 pages
Adaptive Filter Application in Echo Cancellation System and Implementation Using FPGA
No ratings yet
Adaptive Filter Application in Echo Cancellation System and Implementation Using FPGA
13 pages
Experiment Standing Wave
No ratings yet
Experiment Standing Wave
12 pages
Module 4 - Lecture Notes Engineering Design-Pages-15-18,3-13,1
No ratings yet
Module 4 - Lecture Notes Engineering Design-Pages-15-18,3-13,1
16 pages
UT 1 Science Class 10
No ratings yet
UT 1 Science Class 10
6 pages
Practice B: Surface Area of Prisms and Cylinders
No ratings yet
Practice B: Surface Area of Prisms and Cylinders
10 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
2.2 Geometric Patterns
No ratings yet
2.2 Geometric Patterns
42 pages
Normal Modes - Rigid Element Analysis With RBE2 and CONM2
No ratings yet
Normal Modes - Rigid Element Analysis With RBE2 and CONM2
22 pages
Fibonacci Sequence
No ratings yet
Fibonacci Sequence
6 pages
LaSalleCollege F3 Maths Final Exam Paper 1 Section AB 2012 13
No ratings yet
LaSalleCollege F3 Maths Final Exam Paper 1 Section AB 2012 13
8 pages
Rolling Regression Theory
No ratings yet
Rolling Regression Theory
30 pages
CMPS161 Class Notes Chap 03
No ratings yet
CMPS161 Class Notes Chap 03
20 pages
LAS DRAWING-Q2-Classification of Drawing Tools
No ratings yet
LAS DRAWING-Q2-Classification of Drawing Tools
3 pages
5.1 Oscillations (Part 1)
No ratings yet
5.1 Oscillations (Part 1)
2 pages
Abstract Algebra - Proof of Prime and Irreducible Equivalences in PIDs - Mathematics Stack Exchange
No ratings yet
Abstract Algebra - Proof of Prime and Irreducible Equivalences in PIDs - Mathematics Stack Exchange
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

2-Scatterplots and Correlation

Uploaded by

2-Scatterplots and Correlation

Uploaded by

Scatterplots and Correlation

Watch Video At: https://youtu.be/wqMwTDVT9_Y

Watch Video At: https://youtu.be/wBDC5AmYgyg

Example 1 a. Scatter plot showing negative correlation between two

# TO DO: Necessary import

# Read the CSV file

Alternative Approach - seaborn.regplot()

Example 1 b. Scatter plot showing negative correlation between two

Example 2. Scatter plot showing a positive correlation between two

Example 3. Plot the regression line on the transformed data

def log_trans(x, inverse = False):

Regression line on a scattered plot based on the log-transformed data

You might also like