0% found this document useful (0 votes)

151 views15 pages

Exercise2 Solution

IE0005 Exercise solutions 2

Uploaded by

Derrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views15 pages

Exercise2 Solution

IE0005 Exercise solutions 2

Uploaded by

Derrick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Exercise 2 : Basic Statistics

Essential Libraries
Let us begin by importing the essential Python Libraries.

NumPy : Library for Numeric Computations in Python

Pandas : Library for Data Acquisition and Preparation
Matplotlib : Low-level library for Data Visualization
Seaborn : Higher-level library for Data Visualization

# Basic Libraries
import numpy as np
import pandas as pd
import seaborn as sb
import [Link] as plt # we only need pyplot
[Link]() # set the default Seaborn style for graphics

Problem 1 : Data Preparation

Dataset from Kaggle : The "House Prices" competition
Source: [Link]

The dataset is [Link]; hence we use the read_csv function from Pandas.
Immediately after importing, take a quick look at the data using the head function.

houseData = pd.read_csv('[Link]')
[Link]()

Id MSSubClass MSZoning LotFrontage LotArea Street Alley LotShape

\
0 1 60 RL 65.0 8450 Pave NaN Reg

1 2 20 RL 80.0 9600 Pave NaN Reg

2 3 60 RL 68.0 11250 Pave NaN IR1

3 4 70 RL 60.0 9550 Pave NaN IR1

4 5 60 RL 84.0 14260 Pave NaN IR1

LandContour Utilities ... PoolArea PoolQC Fence MiscFeature MiscVal

MoSold \
0 Lvl AllPub ... 0 NaN NaN NaN 0
2
1 Lvl AllPub ... 0 NaN NaN NaN 0
5
2 Lvl AllPub ... 0 NaN NaN NaN 0
9
3 Lvl AllPub ... 0 NaN NaN NaN 0
2
4 Lvl AllPub ... 0 NaN NaN NaN 0
12

YrSold SaleType SaleCondition SalePrice

0 2008 WD Normal 208500
1 2007 WD Normal 181500
2 2008 WD Normal 223500
3 2006 WD Abnorml 140000
4 2008 WD Normal 250000

[5 rows x 81 columns]

You may get information about the data types using dtypes.

[Link]

Id int64
MSSubClass int64
MSZoning object
LotFrontage float64
LotArea int64
...
MoSold int64
YrSold int64
SaleType object
SaleCondition object
SalePrice int64
Length: 81, dtype: object

You may also get more information about the dataset using info().

[Link]()

<class '[Link]'>
RangeIndex: 1460 entries, 0 to 1459
Data columns (total 81 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 1460 non-null int64
1 MSSubClass 1460 non-null int64
2 MSZoning 1460 non-null object
3 LotFrontage 1201 non-null float64
4 LotArea 1460 non-null int64
5 Street 1460 non-null object
6 Alley 91 non-null object
7 LotShape 1460 non-null object
8 LandContour 1460 non-null object
9 Utilities 1460 non-null object
10 LotConfig 1460 non-null object
11 LandSlope 1460 non-null object
12 Neighborhood 1460 non-null object
13 Condition1 1460 non-null object
14 Condition2 1460 non-null object
15 BldgType 1460 non-null object
16 HouseStyle 1460 non-null object
17 OverallQual 1460 non-null int64
18 OverallCond 1460 non-null int64
19 YearBuilt 1460 non-null int64
20 YearRemodAdd 1460 non-null int64
21 RoofStyle 1460 non-null object
22 RoofMatl 1460 non-null object
23 Exterior1st 1460 non-null object
24 Exterior2nd 1460 non-null object
25 MasVnrType 1452 non-null object
26 MasVnrArea 1452 non-null float64
27 ExterQual 1460 non-null object
28 ExterCond 1460 non-null object
29 Foundation 1460 non-null object
30 BsmtQual 1423 non-null object
31 BsmtCond 1423 non-null object
32 BsmtExposure 1422 non-null object
33 BsmtFinType1 1423 non-null object
34 BsmtFinSF1 1460 non-null int64
35 BsmtFinType2 1422 non-null object
36 BsmtFinSF2 1460 non-null int64
37 BsmtUnfSF 1460 non-null int64
38 TotalBsmtSF 1460 non-null int64
39 Heating 1460 non-null object
40 HeatingQC 1460 non-null object
41 CentralAir 1460 non-null object
42 Electrical 1459 non-null object
43 1stFlrSF 1460 non-null int64
44 2ndFlrSF 1460 non-null int64
45 LowQualFinSF 1460 non-null int64
46 GrLivArea 1460 non-null int64
47 BsmtFullBath 1460 non-null int64
48 BsmtHalfBath 1460 non-null int64
49 FullBath 1460 non-null int64
50 HalfBath 1460 non-null int64
51 BedroomAbvGr 1460 non-null int64
52 KitchenAbvGr 1460 non-null int64
53 KitchenQual 1460 non-null object
54 TotRmsAbvGrd 1460 non-null int64
55 Functional 1460 non-null object
56 Fireplaces 1460 non-null int64
57 FireplaceQu 770 non-null object
58 GarageType 1379 non-null object
59 GarageYrBlt 1379 non-null float64
60 GarageFinish 1379 non-null object
61 GarageCars 1460 non-null int64
62 GarageArea 1460 non-null int64
63 GarageQual 1379 non-null object
64 GarageCond 1379 non-null object
65 PavedDrive 1460 non-null object
66 WoodDeckSF 1460 non-null int64
67 OpenPorchSF 1460 non-null int64
68 EnclosedPorch 1460 non-null int64
69 3SsnPorch 1460 non-null int64
70 ScreenPorch 1460 non-null int64
71 PoolArea 1460 non-null int64
72 PoolQC 7 non-null object
73 Fence 281 non-null object
74 MiscFeature 54 non-null object
75 MiscVal 1460 non-null int64
76 MoSold 1460 non-null int64
77 YrSold 1460 non-null int64
78 SaleType 1460 non-null object
79 SaleCondition 1460 non-null object
80 SalePrice 1460 non-null int64
dtypes: float64(3), int64(35), object(43)
memory usage: 924.0+ KB

Note that there are 35 int64 and 3 float64 variables in the dataset.
Extract the 38 variables by filtering the variables using their dtypes.

houseDataNum = [Link][:, [Link] == np.int64]

print("Data dims : ", [Link])
[Link]() # note that all variables are now int64

Data dims : (1460, 35)

<class '[Link]'>
RangeIndex: 1460 entries, 0 to 1459
Data columns (total 35 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 1460 non-null int64
1 MSSubClass 1460 non-null int64
2 LotArea 1460 non-null int64
3 OverallQual 1460 non-null int64
4 OverallCond 1460 non-null int64
5 YearBuilt 1460 non-null int64
6 YearRemodAdd 1460 non-null int64
7 BsmtFinSF1 1460 non-null int64
8 BsmtFinSF2 1460 non-null int64
9 BsmtUnfSF 1460 non-null int64
10 TotalBsmtSF 1460 non-null int64
11 1stFlrSF 1460 non-null int64
12 2ndFlrSF 1460 non-null int64
13 LowQualFinSF 1460 non-null int64
14 GrLivArea 1460 non-null int64
15 BsmtFullBath 1460 non-null int64
16 BsmtHalfBath 1460 non-null int64
17 FullBath 1460 non-null int64
18 HalfBath 1460 non-null int64
19 BedroomAbvGr 1460 non-null int64
20 KitchenAbvGr 1460 non-null int64
21 TotRmsAbvGrd 1460 non-null int64
22 Fireplaces 1460 non-null int64
23 GarageCars 1460 non-null int64
24 GarageArea 1460 non-null int64
25 WoodDeckSF 1460 non-null int64
26 OpenPorchSF 1460 non-null int64
27 EnclosedPorch 1460 non-null int64
28 3SsnPorch 1460 non-null int64
29 ScreenPorch 1460 non-null int64
30 PoolArea 1460 non-null int64
31 MiscVal 1460 non-null int64
32 MoSold 1460 non-null int64
33 YrSold 1460 non-null int64
34 SalePrice 1460 non-null int64
dtypes: int64(35)
memory usage: 399.3 KB

That was very Pythonic way of implementing the dtypes filter.

There is a much cleaner way of doing it in Pandas, as follows.

houseDataNum = houseData.select_dtypes(include = np.int64)

print("Data dims : ", [Link])
[Link]() # note that all variables are now int64

Data dims : (1460, 35)

Read data_description.txt (from the Kaggle data folder) to identify the actual Numeric
variables.
Note that this table is created manually, and this is my interpretation. Feel free to choose your
own.

Variable Observation
Id Numeric, but simply an index
MSSubClass Categorial, numeric encoding
LotArea Numeric Variable
OverallQual Categorial : Ordinal 1-to-10
Variable Observation
OverallCond Categorial : Ordinal 1-to-10
YearBuilt Time Stamp, not just numeric
YearRemodAdd Time Stamp, not just numeric
BsmtFinSF1 Numeric Variable
BsmtFinSF2 Numeric Variable
BsmtUnfSF Numeric Variable
TotalBsmtSF Numeric Variable
1stFlrSF Numeric Variable
2ndFlrSF Numeric Variable
LowQualFinSF Numeric Variable
GrLivArea Numeric Variable
BsmtFullBath Numeric Variable
BsmtHalfBath Numeric Variable
FullBath Numeric Variable
HalfBath Numeric Variable
BedroomAbvGr Numeric Variable
KitchenAbvGr Numeric Variable
TotRmsAbvGrd Numeric Variable
Fireplaces Numeric Variable
GarageCars Numeric Variable
GarageArea Numeric Variable
WoodDeckSF Numeric Variable
OpenPorchSF Numeric Variable
EnclosedPorc Numeric Variable
3SsnPorch Numeric Variable
ScreenPorch Numeric Variable
PoolArea Numeric Variable
MiscVal Numeric Variable
MoSold Time Stamp, not just numeric
YrSold Time Stamp, not just numeric
SalePrice Numeric Variable

Drop the non-Numeric variables (axis = 1) from the DataFrame to obtain a pure Numeric
DataFrame. Keeping Id for records.

houseDataNum =
[Link](['MSSubClass','OverallQual','OverallCond','YearBuilt
','YearRemodAdd','MoSold','YrSold'], axis = 1)
[Link]()

<class '[Link]'>
RangeIndex: 1460 entries, 0 to 1459
Data columns (total 28 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 1460 non-null int64
1 LotArea 1460 non-null int64
2 BsmtFinSF1 1460 non-null int64
3 BsmtFinSF2 1460 non-null int64
4 BsmtUnfSF 1460 non-null int64
5 TotalBsmtSF 1460 non-null int64
6 1stFlrSF 1460 non-null int64
7 2ndFlrSF 1460 non-null int64
8 LowQualFinSF 1460 non-null int64
9 GrLivArea 1460 non-null int64
10 BsmtFullBath 1460 non-null int64
11 BsmtHalfBath 1460 non-null int64
12 FullBath 1460 non-null int64
13 HalfBath 1460 non-null int64
14 BedroomAbvGr 1460 non-null int64
15 KitchenAbvGr 1460 non-null int64
16 TotRmsAbvGrd 1460 non-null int64
17 Fireplaces 1460 non-null int64
18 GarageCars 1460 non-null int64
19 GarageArea 1460 non-null int64
20 WoodDeckSF 1460 non-null int64
21 OpenPorchSF 1460 non-null int64
22 EnclosedPorch 1460 non-null int64
23 3SsnPorch 1460 non-null int64
24 ScreenPorch 1460 non-null int64
25 PoolArea 1460 non-null int64
26 MiscVal 1460 non-null int64
27 SalePrice 1460 non-null int64
dtypes: int64(28)
memory usage: 319.5 KB

Problem 2 : Statistical Summary

Extract just one variable, SalePrice, from the DataFrame.

saleprice = [Link](houseDataNum['SalePrice'])
print("Data type : ", type(saleprice))
print("Data dims : ", [Link])
[Link]()

Data type : <class '[Link]'>

Data dims : 1460

SalePrice
0 208500
1 181500
2 223500
3 140000
4 250000

Summary Statistics of saleprice, followed by Statistical Visualizations on the variable.

[Link]()

SalePrice
count 1460.000000
mean 180921.195890
std 79442.502883
min 34900.000000
25% 129975.000000
50% 163000.000000
75% 214000.000000
max 755000.000000

f = [Link](figsize=(24, 4))
[Link](data=saleprice, orient = "h", color = "cornflowerblue")

<AxesSubplot:>

f = [Link](figsize=(24, 12))
[Link](data=saleprice, x = "SalePrice", color = "royalblue")

<AxesSubplot:xlabel='SalePrice', ylabel='Count'>
f = [Link](figsize=(24, 12))
[Link](data=saleprice, orient='h')

<AxesSubplot:>

Summary Statistics of LotArea, followed by Statistical Visualizations on the variable.

lotarea = [Link](houseDataNum['LotArea'])
print("Data type : ", type(lotarea))
print("Data dims : ", [Link])
[Link]()

Data type : <class '[Link]'>

Data dims : 1460

LotArea
0 8450
1 9600
2 11250
3 9550
4 14260

[Link]()

LotArea
count 1460.000000
mean 10516.828082
std 9981.264932
min 1300.000000
25% 7553.500000
50% 9478.500000
75% 11601.500000
max 215245.000000

f = [Link](figsize=(24, 4))
[Link](data=lotarea, orient = "h")

<AxesSubplot:>

f = [Link](figsize=(24, 12))
[Link](data=lotarea, x = "LotArea", color = "brown")

<AxesSubplot:xlabel='LotArea', ylabel='Count'>
f = [Link](figsize=(24, 12))
[Link](data=lotarea, orient='h')

<AxesSubplot:>

Extract two variables from the DataFrame -- SalePrice and LotArea -- and check their
mutual relationship.
saleprice = [Link](houseDataNum['SalePrice'])
lotarea = [Link](houseDataNum['LotArea'])

# Set up matplotlib figure with three subplots

f, axes = [Link](2, 3, figsize=(24, 12))

# Plot the basic uni-variate figures for HP

[Link](data=saleprice, orient = "h", ax = axes[0,0])
[Link](data=saleprice, ax = axes[0,1])
[Link](data=saleprice, ax = axes[0,2])

# Plot the basic uni-variate figures for Attack

[Link](data=lotarea, orient = "h", ax = axes[1,0])
[Link](data=lotarea, ax = axes[1,1])
[Link](data=lotarea, ax = axes[1,2])

<AxesSubplot:>

jointDF = [Link]([lotarea, saleprice], axis =

1).reindex([Link])
[Link](data=jointDF, x = 'LotArea', y = 'SalePrice', height =
16)

<[Link] at 0x1cd4be7f5e0>
# Calculate the correlation between the two columns/variables
[Link]()

LotArea SalePrice
LotArea 1.000000 0.263843
SalePrice 0.263843 1.000000

[Link]([Link](), vmin = -1, vmax = 1, annot = True,

fmt=".2f")

<AxesSubplot:>

Python Libraries for House Prices Data
No ratings yet
Python Libraries for House Prices Data
84 pages
Ex 1
No ratings yet
Ex 1
119 pages
House Price Prediction Analysis
No ratings yet
House Price Prediction Analysis
14 pages
Linear Regression - House Price Prediction
100% (2)
Linear Regression - House Price Prediction
174 pages
Regression Workbook
No ratings yet
Regression Workbook
2 pages
ML Beginners: Predict House Prices
No ratings yet
ML Beginners: Predict House Prices
32 pages
Deep Learning - House Price Prediction
No ratings yet
Deep Learning - House Price Prediction
17 pages
Exercise3 Solution
No ratings yet
Exercise3 Solution
19 pages
House Price Prediction Model Guide
No ratings yet
House Price Prediction Model Guide
187 pages
Eda Project
No ratings yet
Eda Project
28 pages
House Price Prediction Analysis
No ratings yet
House Price Prediction Analysis
18 pages
Real Estate Data Insights
No ratings yet
Real Estate Data Insights
7 pages
Kaggle House Prices Advanced Regression Techniques
No ratings yet
Kaggle House Prices Advanced Regression Techniques
87 pages
Pract1.printdsbdapdf 2
No ratings yet
Pract1.printdsbdapdf 2
7 pages
Data Science: Housing Price Prediction
No ratings yet
Data Science: Housing Price Prediction
2 pages
EDA Techniques for Data Science Students
No ratings yet
EDA Techniques for Data Science Students
48 pages
Eda On Housing Data
No ratings yet
Eda On Housing Data
7 pages
Housing Data Analysis Overview
No ratings yet
Housing Data Analysis Overview
2 pages
Data Definition
No ratings yet
Data Definition
14 pages
Copy - of - Descriptive - EDA - Munjal - Exercise1.ipynb - Colaboratory
No ratings yet
Copy - of - Descriptive - EDA - Munjal - Exercise1.ipynb - Colaboratory
30 pages
BCA 5th Sem Lab (ML)
No ratings yet
BCA 5th Sem Lab (ML)
20 pages
ADS Exp3
No ratings yet
ADS Exp3
8 pages
NAME
No ratings yet
NAME
11 pages
Exercise6 Solution
No ratings yet
Exercise6 Solution
8 pages
00 Data Wrangling
No ratings yet
00 Data Wrangling
10 pages
Prepared by Asif Bhat Exploratory Data Analysis: Explore Dataset
No ratings yet
Prepared by Asif Bhat Exploratory Data Analysis: Explore Dataset
143 pages
Assignement 4
No ratings yet
Assignement 4
6 pages
House Price Prediction Guide
No ratings yet
House Price Prediction Guide
14 pages
Data Cleaning On Melbourne Housing
No ratings yet
Data Cleaning On Melbourne Housing
16 pages
Quantam - Learning - Colaboratory
No ratings yet
Quantam - Learning - Colaboratory
13 pages
Ds ML House Price Book
No ratings yet
Ds ML House Price Book
46 pages
Predicting Home Prices in Bangalore
No ratings yet
Predicting Home Prices in Bangalore
18 pages
Intro to ML with Sklearn & Python
No ratings yet
Intro to ML with Sklearn & Python
10 pages
Pract1.printdsbdapdf 2
No ratings yet
Pract1.printdsbdapdf 2
10 pages
Exp 10
No ratings yet
Exp 10
1 page
Setup: Chapter 2 - End-To-End Machine Learning Project
No ratings yet
Setup: Chapter 2 - End-To-End Machine Learning Project
31 pages
Capstone Project 6 April
No ratings yet
Capstone Project 6 April
64 pages
King County House Price Analysis
No ratings yet
King County House Price Analysis
1 page
Housing Data Analysis with Python
No ratings yet
Housing Data Analysis with Python
26 pages
Real Estate Price Prediction Guide
No ratings yet
Real Estate Price Prediction Guide
13 pages
King County House Sales Data Analysis
No ratings yet
King County House Sales Data Analysis
11 pages
Boston Housing Solutions
No ratings yet
Boston Housing Solutions
3 pages
House Price Prediction Models
No ratings yet
House Price Prediction Models
16 pages
Intro to Pandas for Data Science
No ratings yet
Intro to Pandas for Data Science
6 pages
Tarea - Prediccion de Casas en California
No ratings yet
Tarea - Prediccion de Casas en California
5 pages
Real Estate Price Prediction Model
No ratings yet
Real Estate Price Prediction Model
33 pages
Delhi Housing Price Analysis Model
No ratings yet
Delhi Housing Price Analysis Model
151 pages
(House Price Prediction) Capstone Project For Python
No ratings yet
(House Price Prediction) Capstone Project For Python
10 pages
TAREA TERCER PARCIAL - Jupyter Notebook
No ratings yet
TAREA TERCER PARCIAL - Jupyter Notebook
15 pages
Normialization Dataset
No ratings yet
Normialization Dataset
7 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
24 pages
Kaggle House Price Prediction Analysis
No ratings yet
Kaggle House Price Prediction Analysis
73 pages
Wa0025.
No ratings yet
Wa0025.
19 pages
Housing Data Cleaning & Analysis
No ratings yet
Housing Data Cleaning & Analysis
7 pages
Pandas Assignment 1
No ratings yet
Pandas Assignment 1
7 pages
002 Python Pandas
No ratings yet
002 Python Pandas
19 pages
Minor Assignment
No ratings yet
Minor Assignment
34 pages
Customer Segmentation Analysis
No ratings yet
Customer Segmentation Analysis
34 pages
Blog - Arsalan Dehghani's Oracle Blog
No ratings yet
Blog - Arsalan Dehghani's Oracle Blog
18 pages
Memory System Overview and Technologies
No ratings yet
Memory System Overview and Technologies
44 pages
Big Data Analytics
No ratings yet
Big Data Analytics
1 page
Introduction to Geographic Information Systems
No ratings yet
Introduction to Geographic Information Systems
2 pages
Chapter1 5
No ratings yet
Chapter1 5
71 pages
Lab#05
No ratings yet
Lab#05
2 pages
PDF Makroekonomi Mankiw Edisi 6 PDF 12 Compress
No ratings yet
PDF Makroekonomi Mankiw Edisi 6 PDF 12 Compress
3 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Business Planning Process
100% (3)
Business Planning Process
53 pages
Symfony CLI Cheat Sheet
No ratings yet
Symfony CLI Cheat Sheet
4 pages
Unit 1
No ratings yet
Unit 1
106 pages
How To Write MYP Science Lab Reports
100% (2)
How To Write MYP Science Lab Reports
29 pages
Linked Stack Implementation in C
No ratings yet
Linked Stack Implementation in C
16 pages
ArcGIS Shapefile Files Types & Extensions
No ratings yet
ArcGIS Shapefile Files Types & Extensions
4 pages
How Do I Copy An Oracle DB From One Server To Another
No ratings yet
How Do I Copy An Oracle DB From One Server To Another
2 pages
An Integrated Dataset of Spatiotemporal and Event
No ratings yet
An Integrated Dataset of Spatiotemporal and Event
12 pages
Mathan's Resume
No ratings yet
Mathan's Resume
1 page
Dice Resume CV SAI KARTHIK
No ratings yet
Dice Resume CV SAI KARTHIK
4 pages
Title of The Study
No ratings yet
Title of The Study
5 pages
A Detailed Roadmap To Become A Business Analyst
100% (1)
A Detailed Roadmap To Become A Business Analyst
11 pages
10g Installation Steps On LINUX
No ratings yet
10g Installation Steps On LINUX
6 pages
PACT Analysis in Interaction Design
No ratings yet
PACT Analysis in Interaction Design
20 pages
Paperless Research Output Strategy
No ratings yet
Paperless Research Output Strategy
16 pages
Certified List of Candidates: Region Xiii Agusan Del Sur Provincial Governor
No ratings yet
Certified List of Candidates: Region Xiii Agusan Del Sur Provincial Governor
21 pages
DSA - Week 3 - Linked List
No ratings yet
DSA - Week 3 - Linked List
32 pages
VT 2009 0102 07 Eng M.a.olabuenaga Ornes
No ratings yet
VT 2009 0102 07 Eng M.a.olabuenaga Ornes
21 pages
Indusoft Modbus PDF
No ratings yet
Indusoft Modbus PDF
25 pages
Attribute Data II
No ratings yet
Attribute Data II
6 pages
MLK Chapter 7 1 Handout
No ratings yet
MLK Chapter 7 1 Handout
11 pages
Understanding Quantitative Research Objectives
No ratings yet
Understanding Quantitative Research Objectives
17 pages

Exercise2 Solution

Uploaded by

Exercise2 Solution

Uploaded by

Exercise 2 : Basic Statistics

NumPy : Library for Numeric Computations in Python

Problem 1 : Data Preparation

Id MSSubClass MSZoning LotFrontage LotArea Street Alley LotShape

1 2 20 RL 80.0 9600 Pave NaN Reg

2 3 60 RL 68.0 11250 Pave NaN IR1

3 4 70 RL 60.0 9550 Pave NaN IR1

4 5 60 RL 84.0 14260 Pave NaN IR1

LandContour Utilities ... PoolArea PoolQC Fence MiscFeature MiscVal

YrSold SaleType SaleCondition SalePrice

houseDataNum = [Link][:, [Link] == np.int64]

Data dims : (1460, 35)

That was very Pythonic way of implementing the dtypes filter.

houseDataNum = houseData.select_dtypes(include = np.int64)

Data dims : (1460, 35)

Problem 2 : Statistical Summary

Data type : <class '[Link]'>

Summary Statistics of saleprice, followed by Statistical Visualizations on the variable.

Summary Statistics of LotArea, followed by Statistical Visualizations on the variable.

Data type : <class '[Link]'>

# Set up matplotlib figure with three subplots

# Plot the basic uni-variate figures for HP

# Plot the basic uni-variate figures for Attack

jointDF = [Link]([lotarea, saleprice], axis =

[Link]([Link](), vmin = -1, vmax = 1, annot = True,

You might also like