0% found this document useful (0 votes)

101 views29 pages

Assignment CSE-520

This document contains an assignment submission for a statistics for data science course. The assignment addresses identifying qualitative and quantitative variables, appropriate graphical representations for qualitative variables, creating a histogram and commenting on it for a quantitative variable (mpg), and creating and commenting on stem-leaf plots for all numerical variables. Pie charts are used to represent engine type and transmission type variables. A histogram is used to represent the mpg variable, showing it is right-skewed and multi-modal. Stem-leaf plots are created and commented on for mpg and cylinders variables to examine spread, skewness, outliers, and modality.

Uploaded by

Shafat91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views29 pages

Assignment CSE-520

Uploaded by

Shafat91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Statistics for Data Science

Course Code: CSE-520

Assignment - 01

Submitted to: Dr. Md. Sohel Rana

Associate Professor
Department of Mathematical and Physical Sciences
East West University

Submitted by: Tarik Adnan

Language used: Python 3.8
ID no: 2019-03-96-003
Department of CSE
Date of submission: 20 August, 2020
Q1. Identify the qualitative and quantitative variables.

Answer:

Code Snippet:
import pandas as pd

# Read the CSV file

data = pd.read_csv('F:/File/Car_data.csv')
print(data)

Output:
C:\Python38\python.exe "C:/Users/Tarik Adnan/PycharmProjects/CSE520/read_csv.py"

Car Model mpg cyl disp hp ... qsec vs am gear carb

0 Mazda RX4 21.0 6 160.0 110 ... 16.46 0 1 4 4
1 Mazda RX4 Wag 21.0 6 160.0 110 ... 17.02 0 1 4 4
2 Datsun 710 22.8 4 108.0 93 ... 18.61 1 1 4 1
3 Hornet 4 Drive 21.4 6 258.0 110 ... 19.44 1 0 3 1
4 Hornet Sportabout 18.7 8 360.0 175 ... 17.02 0 0 3 2
5 Valiant 18.1 6 225.0 105 ... 20.22 1 0 3 1
6 Duster 360 14.3 8 360.0 245 ... 15.84 0 0 3 4
7 Merc 240D 24.4 4 146.7 62 ... 20.00 1 0 4 2
8 Merc 230 22.8 4 140.8 95 ... 22.90 1 0 4 2
9 Merc 280 19.2 6 167.6 123 ... 18.30 1 0 4 4
10 Merc 280C 17.8 6 167.6 123 ... 18.90 1 0 4 4
11 Merc 450SE 16.4 8 275.8 180 ... 17.40 0 0 3 3
12 Merc 450SL 17.3 8 275.8 180 ... 17.60 0 0 3 3
13 Merc 450SLC 15.2 8 275.8 180 ... 18.00 0 0 3 3
14 Cadillac Fleetwood 10.4 8 472.0 205 ... 17.98 0 0 3 4
15 Lincoln Continental 10.4 8 460.0 215 ... 17.82 0 0 3 4
16 Chrysler Imperial 14.7 8 440.0 230 ... 17.42 0 0 3 4
17 Fiat 128 32.4 4 78.7 66 ... 19.47 1 1 4 1
18 Honda Civic 30.4 4 75.7 52 ... 18.52 1 1 4 2
19 Toyota Corolla 33.9 4 71.1 65 ... 19.90 1 1 4 1
20 Toyota Corona 21.5 4 120.1 97 ... 20.01 1 0 3 1
21 Dodge Challenger 15.5 8 318.0 150 ... 16.87 0 0 3 2
22 AMC Javelin 15.2 8 304.0 150 ... 17.30 0 0 3 2
23 Camaro Z28 13.3 8 350.0 245 ... 15.41 0 0 3 4
24 Pontiac Firebird 19.2 8 400.0 175 ... 17.05 0 0 3 2
25 Fiat X1-9 27.3 4 79.0 66 ... 18.90 1 1 4 1
26 Porsche 914-2 26.0 4 120.3 91 ... 16.70 0 1 5 2
27 Lotus Europa 30.4 4 95.1 113 ... 16.90 1 1 5 2
28 Ford Pantera L 15.8 8 351.0 264 ... 14.50 0 1 5 4
29 Ferrari Dino 19.7 6 145.0 175 ... 15.50 0 1 5 6
30 Maserati Bora 15.0 8 301.0 335 ... 14.60 0 1 5 8
31 Volvo 142E 21.4 4 121.0 109 ... 18.60 1 1 4 2

[32 rows x 12 columns]

From the data, we can find out the Qualitative and Quantitative variables.
Quantitative variables:
1. mpg (Miles/(US) gallon)
2. cyl (Number of cylinders)
3. disp (Displacement (cu.in.))
4. hp (Gross horsepower)
5. drat (Rear axle ratio)
6. wt (Weight (1000 lbs)
7. qsec (1/4 mile time)
8. gear (Number of forward gears)
9. carb (Number of carburetors)

Qualitative variables:
1. vs (Engine (0 = V-shaped, 1 = straight))
2. am (Transmission (0 = automatic, 1 = manual))

Q2. Give the appropriate graphical representation for qualitative variables. Comment the
graphs.
Answer:

a. Graphical representation for the variable: “vs (Engine (0 = V-shaped, 1 = straight))”

Code Snippet:
import matplotlib.pyplot as plt
import pandas as pd

fig, ax = plt.subplots(figsize=(10, 6), subplot_kw=dict(aspect="equal"))

# Read CSV
data = pd.read_csv('F:/File/Car_data.csv').vs
ingredients= ['V -Shaped', 'Straight']
data.value_counts().plot.pie(autopct='%1.2f%%', startangle=90,
counterclock=True)

ax.set_title("A Pie Representation of Engine Type (V -Shaped/Straight)

Distribution")
ax.legend(ingredients,
title="Engine Type: ",
loc="center left",
bbox_to_anchor=(1, 0, 0.5, 1))

plt.xlabel('Feature Identifier: ‘vs’')

plt.ylabel('')
plt.show()
Output:

Comment on the plot:

From the above representation it is visible that, 56% of the total cars from the data set is of V –
Shaped engine type and 44% of the total cars from the data set is of Straight engine type.
b. Graphical representation for the variable: “am (Transmission (0 = automatic, 1 =
manual))”
Code Snippet:
import matplotlib.pyplot as plt
import pandas as pd

fig, ax = plt.subplots(figsize=(7, 4), subplot_kw=dict(aspect="equal"))

# Read CSV data

data = pd.read_csv('F:/File/Car_data.csv').am
ingredients = ['Automatic', 'Manual']
data.value_counts().plot.pie(autopct='%1.1f%%', startangle=90,
counterclock=True)

ax.set_title("A Pie Representation of Transmission Type (Automatic/Manual)

Distribution")
ax.legend(ingredients,
title="Transmission Type: ",
loc="center left",
bbox_to_anchor=(1, 0, 0.5, 1))

plt.ylabel('')
plt.xlabel('Feature Identifier: ‘am’')
plt.show()

Output:
Comment on the plot:
From the above representation it is visible that, ~60% of the total cars from the data set is of
Automatic transmission type and ~40% of the total cars from the data set is of Manual
transmission type.
Q3. Give a histogram for the variable ‘mpg (Miles/(US) gallon)’ and comment the plot.
Answer:

Code Snippet:

import pandas as pd
import matplotlib.pyplot as plt
fig = plt.figure(figsize=(6, 4))
data = pd.read_csv('F:/File/Car_data.csv', quoting=3)['mpg']
histogram = plt.hist(data, width=2.2, color='#0504aa', alpha=0.7)
plt.xlim([8, 35])
plt.grid(axis='y', alpha=0.75)
plt.ylabel('Frequency')
plt.xlabel('‘mpg (Miles/(US) gallon)’')
plt.title('Normal Distribution Histogram of Feature: ‘mpg (Miles/(US)
gallon)’')
plt.show()

Output:
Comment on the plot:
 The histogram shows that the majority of the cars have a mileage between 12-24
Miles/gallon
 This is a unimodal histogram which has a hump in between 15-20
 This is not a symmetrical histogram
 The upper tail is longer than the lower tail, so it is positively skewed.

Q4. Give stem-leaf-plot for all numerical variables and comment the plots.
Answer:
1. Stem –leaf plot for feature ‘mpg (Miles/(US) gallon)’
Code Snippet:
import pandas as pd
import stemgraphic
import matplotlib.pyplot as plt
col_name = input("Type the feature name to plot stem-leaf: ")
input_scale = float(input("Input scale to plot stem-leaf: "))

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()

Input:

Type the feature name to plot stem-leaf: mpg

Input scale to plot stem-leaf: 1
Output:
Comment on the plot:
 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘mpg (Miles/(US) gallon)’ usage of the cars. The values range from 10.4 to 33.9.

 Skewed data

When data are skewed, the majority of the data are located on the high or low side
of the graph. Skewness indicates that the data may not be normally
distributed. Often, skewness is easiest to detect with a histogram or a boxplot.

The following stem-and-leaf plot is right skewed.

 Outliers

Outliers, which are data values that are far away from other data values, can strongly
affect the results. Often, outliers are easiest to identify on a boxplot. On a stem-and-
leaf plot, isolated values at the ends identify possible outliers.
The above stem-leaf plot of feature ‘mpg’ it is not indicating any outliers.

 Multi-modal data

2. Stem –leaf plot for feature ‘cyl (Number of cylinders)’

Code Snippet:
import pandas as pd
import stemgraphic
import matplotlib.pyplot as plt
col_name = input("Type the feature name to plot stem-leaf: ")
input_scale = float(input("Input scale to plot stem-leaf: "))

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()
Input:

Type the feature name to plot stem-leaf: cyl

Input scale to plot stem-leaf: 1

Output:
Comment on the plot:
 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘cyl (Number of cylinders)’ of the cars. The values range from 4 to 8.
 Skewed data

The following stem-and-leaf plot is not skewed.

 Outliers

 Multi-modal data

3. Stem – leaf plot for feature ‘disp (Displacement (cu.in.))’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()
Input:

Type the feature name to plot stem-leaf: disp

Input scale to plot stem-leaf: 10

Output:
Comment on the plot:
 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘disp (Displacement (cu.in.))’ of the cars. The values range from 71.1 to 472.0.
 Skewed data

The following stem-and-leaf plot is right skewed.

 Outliers

 Multi-modal data

4. Stem – leaf plot for feature ‘hp (Gross horsepower)’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()
Input:

Type the feature name to plot stem-leaf: hp

Input scale to plot stem-leaf: 10

Output:
Comment on the plot:
 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘hp (Gross horsepower)’ of the cars. The values range from 52 to 335.
 Skewed data

The following stem-and-leaf plot is right skewed.

 Outliers

 Multi-modal data

5. Stem – leaf plot for feature ‘drat (Rear axle ratio)’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()
Input:

Type the feature name to plot stem-leaf: drat

Input scale to plot stem-leaf: .1

Output:
Comment on the plot:
 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘drat (Rear axle ratio)’ of the cars. The values range from 2.76 to 4.93.
 Skewed data

The following stem-and-leaf plot is right skewed.

 Outliers

 Multi-modal data

6. Stem – leaf plot for feature ‘wt (Weight (1000 lbs))’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()
Input:

Type the feature name to plot stem-leaf: wt

Input scale to plot stem-leaf: 1

Output:

Comment on the plot:

 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘drat (Rear axle ratio)’ of the cars. The values range from 1.513 to 5.424.
 Skewed data

The following stem-and-leaf plot is right skewed.

 Outliers

 Multi-modal data

7. Stem – leaf plot for feature ‘qsec (1/4 mile time)’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()

Input:

Type the feature name to plot stem-leaf: qsec

Input scale to plot stem-leaf: 1
Output:

Comment on the plot:

 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘qsec (1/4 mile time)’ of the cars. The values range from 14.5 to 22.9.
 Skewed data

The following stem-and-leaf plot is symmetrical.

 Outliers

 Multi-modal data

8. Stem – leaf plot for feature ‘gear (Number of forward gears) ’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()

Input:

Type the feature name to plot stem-leaf: gear

Input scale to plot stem-leaf: 1
Output:

Comment on the plot:

 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘gear (Number of forward gears)’ of the cars. The values range from 3 to 5.
 Skewed data

The following stem-and-leaf plot is symmetric.

 Outliers

 Multi-modal data

9. Stem – leaf plot for feature ‘carb (Number of carburetors)’

data = pd.read_csv('F:/File/Car_data.csv')[col_name]
y = pd.Series(data)
fig, ax = stemgraphic.stem_graphic(data, scale=input_scale)
fig.set_figheight(8)
fig.set_figwidth(6)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

plt.show()

Input:

Type the feature name to plot stem-leaf: carb

Input scale to plot stem-leaf: 1
Output:

Comment on the plot:

 Spread
The spread shows how much the data vary. The following stem-and-leaf plot shows
‘gear (Number of forward gears)’ of the cars. The values range from 1 to 8.
 Skewed data

The following stem-and-leaf plot is right skewed.

 Outliers

 Multi-modal data

Multi-modal data have multiple peaks, also called modes. Multi-modal data often
indicate that important variables are not yet accounted for.
From the above stem-leaf plot it is visible that it has 2 peaks.
Q5. Find the boxplot for the variable ‘wt(Weight (1000 lbs))’. Do you identify outliers from
this data? If so, how will you interpret those car models?
Answer:
Code Snippet:
import pandas
import matplotlib.pyplot as plt
import glob
files = glob.glob('F:/File/*.csv')
fig = plt.figure(figsize=(10, 4))

for file in files:

df = pandas.read_csv(file)
plt.title('Box-Whiskers Plot For The Feature: ‘wt(Weight (1000 lbs))’')
plt.xlim([1, 6])
plt.xlabel('Weight (1000 lbs)')
plt.ylabel('Feature Identifier')
box = df.boxplot(column=['wt'], vert=0, color='b')
output = df.wt.describe()
print(output)
plt.show()

Output:
Output of describe() function:
C:\Python38\python.exe "C:/Users/Tarik
Adnan/PycharmProjects/CSE520/Box_Whiskers_Plot_WT.py"

count 32.000000
mean 3.217250
std 0.978457
min 1.513000
25% 2.581250
50% 3.325000
75% 3.610000
max 5.424000
Name: wt, dtype: float64

Comment on the plot:

 It is visible from the box whiskers plot that there is 3 outliers. The car models that fall
into outlier are listed below
a. Cadillac Fleetwood (Weight (1000 lbs: 5.25)
b. Lincoln Continental (Weight (1000 lbs: 5.424)
c. Chrysler Imperial (Weight (1000 lbs: 5.345)
 The 75% cars have the weight (1000 lbs) in between 3.610
 The mean of the data for the variable “wt” is 3.217250
 The outlier labeled cars are way too heavy in terms of Weight (1000 lbs) from the
upper fence 4.084.
Q6. Which car models have less ‘mpg’ than the first quarter in terms of less ‘mpg’? Also, find
car models that have more ‘mpg’ than the third quarter.
Answer:
Code Snippet:
import pandas
import matplotlib.pyplot as plt
import glob
files = glob.glob('F:/File/*.csv')
fig = plt.figure(figsize=(6, 4))

for file in files:

df = pandas.read_csv(file)
plt.title('Box-Whiskers Plot For The Feature: ‘mpg (Miles/(US) gallon)’')
plt.xlim([0, 40])
plt.xlabel('Miles/(US) gallon')
plt.ylabel('Feature Identifier')
box = df.boxplot(column=['mpg'], vert=0, color='b')
output = df.mpg.describe()
print(output)
plt.show()

Output:
Output of describe() function:
C:\Python38\python.exe "C:\Users\Tarik
Adnan\PycharmProjects\CSE520\Box_Whiskers_Plot_MPG.py"

count 32.000000
mean 20.090625
std 6.026948
min 10.400000
25% 15.425000
50% 19.200000
75% 22.800000
max 33.900000
Name: mpg, dtype: float64

Comment on the plot:

 It is visible from the box whiskers plot that there is 1 outlier. The car model that falls
into outlier is listed below
a. Toyota Corolla (Miles/(US) gallon: 33.9)
 First quarter (25%) in terms of ‘mpg’ is 15.425. There are total 8 Car models that have
less ‘mpg’ than the first quarter in terms of less ‘mpg’. Those are:
a. Duster 360 (Miles/(US) gallon: 14.3)
b. Merc 450SLC (Miles/(US) gallon: 15.2)
c. Cadillac Fleetwood (Miles/(US) gallon: 10.4)
d. Lincoln Continental (Miles/(US) gallon: 10.4)
e. Chrysler Imperial (Miles/(US) gallon: 14.7)
f. AMC Javelin (Miles/(US) gallon: 15.2)
g. Camaro Z28 (Miles/(US) gallon: 13.3)
h. Maserati Bora (Miles/(US) gallon: 15)
 Third quarter (75%) in terms of ‘mpg’ is 22.80. There are total 7 car models that have
more ‘mpg’ than the third quarter. Those are:
a. Merc 240D (Miles/(US) gallon: 24.4)
b. Fiat 128 (Miles/(US) gallon: 32.4)
c. Honda Civic (Miles/(US) gallon: 30.4)
d. Toyota Corolla (Miles/(US) gallon: 33.9)
e. Fiat X1-9 (Miles/(US) gallon: 27.3)
f. Porsche 914-2 (Miles/(US) gallon: 26.0)
g. Lotus Europa (Miles/(US) gallon: 30.4)

Proton Holding Berhads Company Vision, Mission and Objectives
100% (1)
Proton Holding Berhads Company Vision, Mission and Objectives
3 pages
Letter To Deputy Commissioner (Traffic)
No ratings yet
Letter To Deputy Commissioner (Traffic)
2 pages
En 16432-1 - EN - Ballastless Track Systems - General Requirements
50% (2)
En 16432-1 - EN - Ballastless Track Systems - General Requirements
35 pages
Cars Sales Dashboard
No ratings yet
Cars Sales Dashboard
19 pages
Drawings of Major Bridge at 227+649
No ratings yet
Drawings of Major Bridge at 227+649
35 pages
Assignment 01 - STA102 - Sp-21
No ratings yet
Assignment 01 - STA102 - Sp-21
2 pages
Memu Working
50% (2)
Memu Working
3 pages
Seria S323
No ratings yet
Seria S323
550 pages
Lexus Hybrid rx400h 2008
100% (1)
Lexus Hybrid rx400h 2008
31 pages
APWD NH 20-21 (1) - Merged
No ratings yet
APWD NH 20-21 (1) - Merged
111 pages
Oceanjet Ticket
No ratings yet
Oceanjet Ticket
2 pages
Post Test Praktikum8
No ratings yet
Post Test Praktikum8
14 pages
Project Impact of Car Features
No ratings yet
Project Impact of Car Features
9 pages
Data Set For Excersize
No ratings yet
Data Set For Excersize
20 pages
Predicting The Price of A Used Car Using A Regression Model With Python
No ratings yet
Predicting The Price of A Used Car Using A Regression Model With Python
82 pages
41 - Đinh Thị Thùy Linh - 23a4050209.ipynb Colaboratory
No ratings yet
41 - Đinh Thị Thùy Linh - 23a4050209.ipynb Colaboratory
4 pages
Zeon Auto Research
No ratings yet
Zeon Auto Research
4 pages
Pressure Lube Recip Technical Sales Training
No ratings yet
Pressure Lube Recip Technical Sales Training
30 pages
Project - Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Project - Analyzing The Impact of Car Features On Price and Profitability
8 pages
Bda File
No ratings yet
Bda File
54 pages
Aayushi Bda File
No ratings yet
Aayushi Bda File
41 pages
Assignment
No ratings yet
Assignment
49 pages
Untitled - Ipynb - (5) - JupyterLab
No ratings yet
Untitled - Ipynb - (5) - JupyterLab
4 pages
Week 2 - Assignment: Step 1: What Is The HP (HP Stands For "Horse Power")
No ratings yet
Week 2 - Assignment: Step 1: What Is The HP (HP Stands For "Horse Power")
3 pages
Statistics Introduction
No ratings yet
Statistics Introduction
8 pages
Se Python - Merged
No ratings yet
Se Python - Merged
77 pages
Car Trend Analysis
No ratings yet
Car Trend Analysis
12 pages
Assignment 2 Output 229010
No ratings yet
Assignment 2 Output 229010
17 pages
Project - Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Project - Analyzing The Impact of Car Features On Price and Profitability
8 pages
Car 13591
No ratings yet
Car 13591
2 pages
Practical 5
No ratings yet
Practical 5
5 pages
'Horsepower' "?" 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower'
No ratings yet
'Horsepower' "?" 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower'
5 pages
Automobile Price Data
No ratings yet
Automobile Price Data
53 pages
Mtcars - Ipynb - Colab
No ratings yet
Mtcars - Ipynb - Colab
2 pages
9587 - 9638 - 9563 - ADS - Exp1.ipynb - Colab
No ratings yet
9587 - 9638 - 9563 - ADS - Exp1.ipynb - Colab
8 pages
Team AN
No ratings yet
Team AN
23 pages
Belarus Car Price Prediction
No ratings yet
Belarus Car Price Prediction
18 pages
Car Price Prediction
No ratings yet
Car Price Prediction
35 pages
Automobil E Data Analysis: Name Pgp-Dsba Online January' 21 Date: Dd/mm/yyyy
No ratings yet
Automobil E Data Analysis: Name Pgp-Dsba Online January' 21 Date: Dd/mm/yyyy
11 pages
Car Price Prediction 1
No ratings yet
Car Price Prediction 1
24 pages
R Lab Ex 1 To 5
No ratings yet
R Lab Ex 1 To 5
26 pages
R Studio
No ratings yet
R Studio
5 pages
Untitled 21
No ratings yet
Untitled 21
6 pages
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
No ratings yet
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
23 pages
Car Price Prediction Using ML
No ratings yet
Car Price Prediction Using ML
11 pages
Elite Sports Cars Eda
No ratings yet
Elite Sports Cars Eda
9 pages
Mtcars Data
No ratings yet
Mtcars Data
2 pages
Activity 2
No ratings yet
Activity 2
16 pages
DV Ca-1
No ratings yet
DV Ca-1
9 pages
1AA17AT051 - Bus Terminal Cum Commercial Complex by Sanjay SS PDF
No ratings yet
1AA17AT051 - Bus Terminal Cum Commercial Complex by Sanjay SS PDF
63 pages
R
No ratings yet
R
3 pages
Numpy,,Pandas (24.4.25)
No ratings yet
Numpy,,Pandas (24.4.25)
1 page
An Overview of Statistical Tests in SAS: 1. Introduction and Description of Data
No ratings yet
An Overview of Statistical Tests in SAS: 1. Introduction and Description of Data
8 pages
R Studio
No ratings yet
R Studio
4 pages
4.5.33 Guidelines For Lime Stabilization PDF
100% (1)
4.5.33 Guidelines For Lime Stabilization PDF
4 pages
Lab Assignment 6
No ratings yet
Lab Assignment 6
5 pages
Autos
No ratings yet
Autos
2 pages
Mtcars Dataset Analysis in R
No ratings yet
Mtcars Dataset Analysis in R
4 pages
Barredora RJ350
No ratings yet
Barredora RJ350
158 pages
SMDM Business+Report
No ratings yet
SMDM Business+Report
11 pages
SMDM Business+Report
No ratings yet
SMDM Business+Report
11 pages
Practical NO.3
No ratings yet
Practical NO.3
7 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
ZF 5hp19e Repair Manual
100% (66)
ZF 5hp19e Repair Manual
10 pages
Ticket-Voucher (Elena Christelle)
No ratings yet
Ticket-Voucher (Elena Christelle)
2 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
Data Science Lab
No ratings yet
Data Science Lab
28 pages
Miles Per Gallon
No ratings yet
Miles Per Gallon
11 pages
Statistics Cia 1
No ratings yet
Statistics Cia 1
26 pages
Post: Officer (IT) (1) Sonali Bank Limited (2) Janata Bank Limited
No ratings yet
Post: Officer (IT) (1) Sonali Bank Limited (2) Janata Bank Limited
1 page
Analysis On Car Resale Price
No ratings yet
Analysis On Car Resale Price
13 pages
R.A. 6539
No ratings yet
R.A. 6539
3 pages
Motor Trend Car Road Tests PDF
No ratings yet
Motor Trend Car Road Tests PDF
1 page
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
22 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
Agenda Road Safety Hackathon
No ratings yet
Agenda Road Safety Hackathon
1 page
Selected Outlets
No ratings yet
Selected Outlets
6 pages
Dodge-Durango-1998-USA 4WD Instructions
No ratings yet
Dodge-Durango-1998-USA 4WD Instructions
337 pages
Operacion y Mantenimiento Cargador Doosan dl320 02
No ratings yet
Operacion y Mantenimiento Cargador Doosan dl320 02
11 pages
G05 X5 Hitch Installation Instructions
No ratings yet
G05 X5 Hitch Installation Instructions
11 pages
The Future of Electric Vehicles A Comprehensive Review of Technological Advancements, Market Trends, and Environmental Impacts
No ratings yet
The Future of Electric Vehicles A Comprehensive Review of Technological Advancements, Market Trends, and Environmental Impacts
12 pages
Remove Install Camshaft
No ratings yet
Remove Install Camshaft
4 pages
A PDF
No ratings yet
A PDF
9 pages
ROLL 2 ELS DOC New
No ratings yet
ROLL 2 ELS DOC New
43 pages
Rimini Maps
No ratings yet
Rimini Maps
19 pages
Flare Cars - Flare HR
No ratings yet
Flare Cars - Flare HR
8 pages
Bridge Proposals Pmgsy 2023-24, Batch-1 Status& Geosadak Status
No ratings yet
Bridge Proposals Pmgsy 2023-24, Batch-1 Status& Geosadak Status
15 pages
Listening UN SMA 2005
No ratings yet
Listening UN SMA 2005
2 pages
System Administrator - Co-Operative Societies - Dt. 28.04.2019 Quesstion Code - A
No ratings yet
System Administrator - Co-Operative Societies - Dt. 28.04.2019 Quesstion Code - A
3 pages
Ford Focus Diesel Manual 152 153
No ratings yet
Ford Focus Diesel Manual 152 153
2 pages
Automotive and Small Engine Tools Assessment For CO
No ratings yet
Automotive and Small Engine Tools Assessment For CO
2 pages
Img 20211021 0002
No ratings yet
Img 20211021 0002
1 page
August 2019: Saturday, August 10 at 1:14 PM
No ratings yet
August 2019: Saturday, August 10 at 1:14 PM
1 page
Kawasaki Superbikes: Z1000 D & S
From Everand
Kawasaki Superbikes: Z1000 D & S
Stefan R. Oehl
No ratings yet
The Slot Car Handbook: The definitive guide to setting-up and running Scalextric sytle 1/32 scale ready-to-race slot cars
From Everand
The Slot Car Handbook: The definitive guide to setting-up and running Scalextric sytle 1/32 scale ready-to-race slot cars
Dave Chang
3/5 (1)

Assignment CSE-520

Uploaded by

Assignment CSE-520

Uploaded by

Statistics for Data Science

Course Code: CSE-520

Submitted to: Dr. Md. Sohel Rana

Submitted by: Tarik Adnan

# Read the CSV file

Car Model mpg cyl disp hp ... qsec vs am gear carb

[32 rows x 12 columns]

a. Graphical representation for the variable: “vs (Engine (0 = V-shaped, 1 = straight))”

fig, ax = plt.subplots(figsize=(10, 6), subplot_kw=dict(aspect="equal"))

ax.set_title("A Pie Representation of Engine Type (V -Shaped/Straight)

plt.xlabel('Feature Identifier: ‘vs’')

Comment on the plot:

fig, ax = plt.subplots(figsize=(7, 4), subplot_kw=dict(aspect="equal"))

# Read CSV data

ax.set_title("A Pie Representation of Transmission Type (Automatic/Manual)

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: mpg

The following stem-and-leaf plot is right skewed.

2. Stem –leaf plot for feature ‘cyl (Number of cylinders)’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: cyl

The following stem-and-leaf plot is not skewed.

3. Stem – leaf plot for feature ‘disp (Displacement (cu.in.))’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: disp

The following stem-and-leaf plot is right skewed.

4. Stem – leaf plot for feature ‘hp (Gross horsepower)’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: hp

The following stem-and-leaf plot is right skewed.

5. Stem – leaf plot for feature ‘drat (Rear axle ratio)’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: drat

The following stem-and-leaf plot is right skewed.

6. Stem – leaf plot for feature ‘wt (Weight (1000 lbs))’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: wt

Comment on the plot:

The following stem-and-leaf plot is right skewed.

7. Stem – leaf plot for feature ‘qsec (1/4 mile time)’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: qsec

Comment on the plot:

The following stem-and-leaf plot is symmetrical.

8. Stem – leaf plot for feature ‘gear (Number of forward gears) ’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: gear

Comment on the plot:

The following stem-and-leaf plot is symmetric.

9. Stem – leaf plot for feature ‘carb (Number of carburetors)’

plt.title('Stem-Leaf plot of feature:' + " " + col_name)

Type the feature name to plot stem-leaf: carb

Comment on the plot:

The following stem-and-leaf plot is right skewed.

for file in files:

Comment on the plot:

for file in files:

Comment on the plot:

You might also like