0% found this document useful (0 votes)

70 views20 pages

PML Ex3

The document discusses implementing Matplotlib in Python. It describes Matplotlib as a comprehensive library for creating static, animated, and interactive visualizations. Key Matplotlib functions covered include plot(), scatter(), bar(), hist(), and implementations of linear and polynomial regression. Examples shown include plotting age vs weight data, sales of cars by manufacturer over time, reading real-world CSV data and creating box plots, histograms, scatter plots and bubble charts to analyze features of the data. Polynomial regression is demonstrated by fitting a polynomial model to salary data vs position level.

Uploaded by

Jasmitha B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views20 pages

PML Ex3

Uploaded by

Jasmitha B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Ex No: 3 MATPLOTLIB IN PYTHON

DATE:

Aim:

To implement Matplotlib using Python programming.

Description:
MATPLOTLIB:

Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in
Python. Matplotlib makes easy things easy and hard things possible. Create publication-quality plots.
Make interactive figures that can zoom, pan, and update.

Pyplot:

Most of the Matplotlib utilities lies under the pyplot submodule, and are usually imported under
the plt alias:

import matplotlib.pyplot as plt

Plot():

The plot() function is used to draw points (markers) in a diagram.By default, the plot() function draws a

line from point to point.The function takes parameters for specifying points in the diagram.Parameter 1
is an array containing the points on the x-axis.Parameter 2 is an array containing the points on the y-axis

scatter():

The scatter() function plots one dot for each observation. It needs two arrays of the same length, one for
the values of the x-axis, and one for values on the y-axis

bar():

The bar() function takes arguments that describes the layout of the bars.

The categories and their values represented by the first and second argument as arrays.

plt.bar(x, y)

hist():

A histogram is a graph showing frequency distributions.

It is a graph showing the number of observations within each given interval.

The hist() function will use an array of numbers to create a histogram, the array is sent into the function
as an argument.

Linear Regression:

Linear regression uses the relationship between the data-points to draw a straight line through all them.

This line can be used to predict future values.

Polynomial Regression:

If your data points clearly will not fit a linear regression (a straight line through all data points), it might
be ideal for polynomial regression.

Polynomial regression, like linear regression, uses the relationship between the variables x and y to find
the best way to draw a line through the data points.

IMPLEMENTATION:

1. Plot the Age across Weight using matplotlib. Consider Age and Weight are 1D
array of 10 members. Plot them in X and Y –axis using plot() function.

import matplotlib.pyplot as plt

import numpy as np
age=np.array([23,24,25,26,27,28,29,30,31,32])
weight=np.array([55,50,70,80,57,78,79,75,74,90])
plt.plot(age,weight,'o')
plt.xlabel('age')
plt.ylabel('weight')
plt.title('AGE WITH WEIGHT')
plt.show()

2. Plot a graph between sales of Car by Maruti in each year 2015-2022. Fix the size
of graph, use specific color of line for visualizing.

<Figure size 576x432 with 0 Axes>

import matplotlib.pyplot as plt
import numpy as np
years=([2015,2016,2017,2018,2019,2020,2021,2022])
sales=([700000,600000,400000,500000,900000,800000,1000000,1200000])
plt.plot(years,sales,color='red')
plt.xlabel(years)
plt.ylabel(sales)
plt.title('sales of car Maurti')
plt.figure(figsize=(8,6))
plt.show()
3. Plot the sales of Car by Audi in the same time period in the previous graph using
different color & style line with specification for each color[Hint: use legend()].
Add Title for the graph

import numpy as np
from google.colab import files
sp=files.upload()

Choose Files No file chosen Upload widget is only available when the cell has been
executed in the current browser session. Please rerun this cell to enable.
Saving student-mat.csv to student-mat.csv
import pandas as pd
import matplotlib.pyplot as plt
data = pd.read_csv("student-mat.csv")
plt.scatter(data['age'], data['traveltime'])
plt.title("Scatter Plot")
plt.xlabel('age')
plt.ylabel('traveltime')
plt.show()

4. Read a real-time data in CSV form[Iris, Toy, Car etc.] and analyze features
(i) Finding median, outliers using box plot – single feature[continuous].

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
arr = np.random.randint(1, 20, size=30)
arr1 = np.append(arr, [27, 30])
print('Thus the array becomes{}'.format(arr1))
q1 = np.quantile(arr1, 0.25)
q3 = np.quantile(arr1, 0.75)
med = np.median(arr1)
iqr = q3-q1
upper_bound = q3+(1.5*iqr)
lower_bound = q1-(1.5*iqr)
print(iqr, upper_bound, lower_bound)

Thus the array becomes[19 5 7 12 9 5 17 11 19 10 7 14 10 13 16 3 16 18 7 19 18 9 11 4

15 19 18 17 17 4 27 30]
9.5 32.25 -5.75

plt.boxplot(arr1)
fig = plt.figure(figsize =(10, 7))
plt.show()
q1 = np.quantile(arr1, 0.25)
q3 = np.quantile(arr1, 0.75)
med = np.median(arr1)
iqr = q3-q1
upper_bound = q3+(1.5*iqr)
lower_bound = q1-(1.5*iqr)
print(iqr, upper_bound, lower_bound)

9.5 32.25 -5.75

outliers = arr1[(arr1 <= lower_bound) | (arr1 >= upper_bound)]
print('The following are the outliers in the boxplot:{}'.format(outliers))

The following are the outliers in the boxplot:[20,27]

arr2 = arr1[(arr1 >= lower_bound) & (arr1 <= upper_bound)]
plt.figure(figsize=(12, 7))
plt.boxplot(arr2)
plt.show()
import numpy as np
from google.colab import files
sp=files.upload()

Saving tips.csv to tips.csv

import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
print(sns.get_dataset_names())

['anagrams', 'anscombe', 'attention', 'brain_networks', 'car_crashes', 'diamonds', 'dots',

'dowjones', 'exercise', 'flights', 'fmri', 'geyser', 'glue', 'healthexp', 'iris', 'mpg', 'penguins',
'planets', 'seaice', 'taxis', 'tips', 'titanic']

[]
tips_df=sns.load_dataset('tips')
print(tips_df)
sns.lineplot(x="sex", y="total_bill", data=tips_df)
plt.title('Title using Matplotlib Function')

plt.show()

total_bill tip sex smoker day time size

0 16.99 1.01 Female No Sun Dinner 2
1 10.34 1.66 Male No Sun Dinner 3
2 21.01 3.50 Male No Sun Dinner 3
3 23.68 3.31 Male No Sun Dinner 2
4 24.59 3.61 Female No Sun Dinner 4
.. ... ... ... ... ... ... ...
239 29.03 5.92 Male No Sat Dinner 3
240 27.18 2.00 Female Yes Sat Dinner 2
241 22.67 2.00 Male Yes Sat Dinner 2
242 17.82 1.75 Male No Sat Dinner 2
243 18.78 3.00 Female No Thur Dinner 2
[244 rows x 7 columns]

BOX PLOT:

sns.boxplot(x='day',y='total_bill',data=tips_df,hue='sex',palette='afmhot')
plt.legend(loc=0)

(ii) Finding distribution using bar plot and histogram – Two features [categorical or
grouped].
BARPLOT:

sns.barplot(x='day',y='tip', data=tips_df,
hue='sex')
plt.show()

HISTOGRAM:

sns.histplot(x='total_bill', data=tips_df,kde=True, hue='sex')

plt.show()

(iii) Finding distribution across feature using scatter plot and Bubble chart – 3 or
more features [continuous/ categorical]
SCATTERPLOT:

sns.scatterplot(x='day', y='tip', data=tips_df)
plt.show()

sns.scatterplot(x='day', y='tip', data=tips_df,
hue='sex')
plt.show()

BUBBLE CHART:

import plotly.graph_objects as go

fig = go.Figure(data=[go.Scatter(
    x=[1, 2, 3, 4], y=[10, 11, 12, 13],
    mode='markers',
    marker=dict(
        color=['rgb(93, 164, 214)', 'rgb(255, 144, 14)',
               'rgb(44, 160, 101)', 'rgb(255, 65, 54)'],
        opacity=[1, 0.8, 0.6, 0.4],
        size=[40, 60, 80, 100],
    )
)])

fig.show()

5. Plot any two features from the dataset in scatter plot and find linear regression
between the features and plot the linear fit model

import numpy as np
from google.colab import files
sp=files.upload()

Saving student_scores.csv to student_scores.csv

import numpy as np

import pandas as pd

from matplotlib import pyplot as plt

import seaborn as sns

from sklearn.linear_model import LinearRegression
score_df = pd.read_csv('student_scores.csv')

score_df.head()

score_df.describe()

X = score_df.iloc[:, :-1].values

y = score_df.iloc[:, 1].values
print(y)
[30 90 80 45 67]
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=0)

from sklearn.linear_model import LinearRegression

regressor = LinearRegression()
regressor.fit(X_train, y_train)

y_pred = regressor.predict(X_test)
plt.scatter(X_train, y_train,color='g')

plt.plot(X_test, y_pred,color='k')
plt.show()

6. Plot any two features from the dataset in scatter plot and find polynomial
regression between the features and plot the polynomial model

import numpy as np
from google.colab import files
sp=files.upload()

Saving salary_data.csv to salary_data.csv

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
dataset = pd.read_csv('https://s3.us-west-2.amazonaws.com/public.gamelab.fun/dataset/
position_salaries.csv')
X = dataset.iloc[:, 1:2].values
y = dataset.iloc[:, 2].values

from sklearn.linear_model import LinearRegression
lin_reg = LinearRegression()
lin_reg.fit(X, y)
def viz_linear():
    plt.scatter(X, y, color='red')
    plt.plot(X, lin_reg.predict(X), color='blue')
    plt.title('Truth or Bluff (Linear Regression)')
    plt.xlabel('Position level')
    plt.ylabel('Salary')
    plt.show()
    return
viz_linear()

from sklearn.preprocessing import PolynomialFeatures
poly_reg = PolynomialFeatures(degree=4)
X_poly = poly_reg.fit_transform(X)
pol_reg = LinearRegression()
pol_reg.fit(X_poly, y)
def viz_polymonial():
    plt.scatter(X, y, color='red')
    plt.plot(X, pol_reg.predict(poly_reg.fit_transform(X)), color='blue')
    plt.title('Truth or Bluff (Linear Regression)')
    plt.xlabel('Position level')
    plt.ylabel('Salary')
    plt.show()
    return
viz_polymonial()
lin_reg.predict([[5.5]])
pol_reg.predict(poly_reg.fit_transform([[5.5]]))

array([132148.43750002])

Problem Implementation Time Viva Total

Understanding Management

RESULT:

Thus the Matplotlib using Python programming has been understood and executed successfully.

The Interior Design Business Plan
100% (5)
The Interior Design Business Plan
32 pages
13 Electrical System
100% (1)
13 Electrical System
133 pages
Noun Rules
No ratings yet
Noun Rules
12 pages
Astm F 1145
100% (2)
Astm F 1145
12 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
Confined Space Entry Procedure
100% (2)
Confined Space Entry Procedure
4 pages
Cesc 12 - Q1 - M5 PDF
No ratings yet
Cesc 12 - Q1 - M5 PDF
14 pages
Reading - Toefl
100% (1)
Reading - Toefl
10 pages
Data Visualization
No ratings yet
Data Visualization
35 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
43 pages
Data Visualization - 1 by Matplot Lib
No ratings yet
Data Visualization - 1 by Matplot Lib
19 pages
Macbag Msb-I Feb2012
No ratings yet
Macbag Msb-I Feb2012
1 page
MBA Managerial Economics Unit 1 - Economic Problems and Decision Making
No ratings yet
MBA Managerial Economics Unit 1 - Economic Problems and Decision Making
24 pages
Environmental Ethics Assignment
0% (1)
Environmental Ethics Assignment
6 pages
3-Data Description
No ratings yet
3-Data Description
91 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
5 - Data Summaries and Visualization
No ratings yet
5 - Data Summaries and Visualization
87 pages
2017 Fall ME501 06 VectorCalculus
No ratings yet
2017 Fall ME501 06 VectorCalculus
95 pages
Assignment 5
No ratings yet
Assignment 5
7 pages
3 Data Description
No ratings yet
3 Data Description
87 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
Lab Manual (DAV)
No ratings yet
Lab Manual (DAV)
33 pages
Principles of AI Laboratory Varshadr
No ratings yet
Principles of AI Laboratory Varshadr
54 pages
Lessons From Gattinoni
No ratings yet
Lessons From Gattinoni
28 pages
m2l12 PDF
No ratings yet
m2l12 PDF
8 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
FDS Unit 5 JPR
No ratings yet
FDS Unit 5 JPR
64 pages
ML Lab Manual
No ratings yet
ML Lab Manual
23 pages
AD3411
No ratings yet
AD3411
28 pages
Unit 5
No ratings yet
Unit 5
25 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
UNIT-5 Important Q-A
No ratings yet
UNIT-5 Important Q-A
22 pages
Data Visualization Lab3
No ratings yet
Data Visualization Lab3
23 pages
Matplotlib Functions
No ratings yet
Matplotlib Functions
32 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
ML (Sudhanshu)
No ratings yet
ML (Sudhanshu)
24 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
Data Visualization
No ratings yet
Data Visualization
48 pages
6.lab Activity
No ratings yet
6.lab Activity
23 pages
ML Record
No ratings yet
ML Record
19 pages
Matplot Lib Practicals
No ratings yet
Matplot Lib Practicals
24 pages
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
No ratings yet
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
13 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
Exp 2 SDK Ok
No ratings yet
Exp 2 SDK Ok
18 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
DAV Practical
No ratings yet
DAV Practical
12 pages
BDA File
No ratings yet
BDA File
26 pages
Matplotlib Python
No ratings yet
Matplotlib Python
8 pages
Unit 4 (2) Python
No ratings yet
Unit 4 (2) Python
27 pages
Data Visualisation
No ratings yet
Data Visualisation
5 pages
RCF-1865 Rechageable Fan R5 (IB Format - ENG)
No ratings yet
RCF-1865 Rechageable Fan R5 (IB Format - ENG)
16 pages
Experiment - 2.3 Krikita
No ratings yet
Experiment - 2.3 Krikita
12 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
34 pages
Lab 3
No ratings yet
Lab 3
14 pages
DV Nivas
No ratings yet
DV Nivas
24 pages
End Semester Answer Key Format-Fods
No ratings yet
End Semester Answer Key Format-Fods
8 pages
Xudu
No ratings yet
Xudu
22 pages
ANL252 SU3 Jul2022
No ratings yet
ANL252 SU3 Jul2022
23 pages
FDS Slips Solution
No ratings yet
FDS Slips Solution
7 pages
HW 1
No ratings yet
HW 1
11 pages
DataVisualization - 1 Surya Sir
No ratings yet
DataVisualization - 1 Surya Sir
51 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
22 pages
DS3 1
No ratings yet
DS3 1
8 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Grade 10 AI Practicals DATA SCIENCE-Solution
No ratings yet
Grade 10 AI Practicals DATA SCIENCE-Solution
6 pages
PDS Exp 10 To 12
No ratings yet
PDS Exp 10 To 12
8 pages
Leading For The Future
No ratings yet
Leading For The Future
4 pages
Unit 5
No ratings yet
Unit 5
10 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
11 pages
Machinelearning Prac
No ratings yet
Machinelearning Prac
17 pages
211 CRT Cable Disconnected Loc1 SM 4 139 Scanner Power Cable Out Loc3 LRG 2 149 Printer Paper Jam Loc2 MED 3
No ratings yet
211 CRT Cable Disconnected Loc1 SM 4 139 Scanner Power Cable Out Loc3 LRG 2 149 Printer Paper Jam Loc2 MED 3
7 pages
Businesses Proposal
No ratings yet
Businesses Proposal
9 pages
Design and Implementation of Smart Micro-Grid and Its Digital Replica: First Steps
No ratings yet
Design and Implementation of Smart Micro-Grid and Its Digital Replica: First Steps
7 pages
Data Visualizations in Python With Matplotlib: Sidita Duli, PHD
No ratings yet
Data Visualizations in Python With Matplotlib: Sidita Duli, PHD
6 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Lab 9
No ratings yet
Lab 9
2 pages
SM T311 - Direy 6
No ratings yet
SM T311 - Direy 6
3 pages
Battery Impedance Test Equipment: Bite 2 and BITE 2P
No ratings yet
Battery Impedance Test Equipment: Bite 2 and BITE 2P
4 pages
Testbank For Economics of Money Banking and Financial Markets The 13th Edition Mishkin Instant Download
No ratings yet
Testbank For Economics of Money Banking and Financial Markets The 13th Edition Mishkin Instant Download
18 pages
DV Lab Manual 2022-23
No ratings yet
DV Lab Manual 2022-23
10 pages
Apple Inc Company:: Foundation
No ratings yet
Apple Inc Company:: Foundation
5 pages
OVERVIEW Cost Quality
No ratings yet
OVERVIEW Cost Quality
2 pages
Finding N The Business Day in Peoplesoft
No ratings yet
Finding N The Business Day in Peoplesoft
3 pages
Procedimiento Actualización SW Juniper
No ratings yet
Procedimiento Actualización SW Juniper
4 pages
Diseases Parasites and Predators Management and Control
No ratings yet
Diseases Parasites and Predators Management and Control
7 pages
Be A 65 Ads Exp 2
No ratings yet
Be A 65 Ads Exp 2
10 pages
December 2 Flier Final-NEW PDF
No ratings yet
December 2 Flier Final-NEW PDF
1 page
Fairy Tale NATIONAL Mermaid Story
No ratings yet
Fairy Tale NATIONAL Mermaid Story
8 pages
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)

PML Ex3

Uploaded by

PML Ex3

Uploaded by

Ex No: 3 MATPLOTLIB IN PYTHON

To implement Matplotlib using Python programming.

The plot() function is used to draw points (markers) in a diagram.By default, the plot() function draws a

The bar() function takes arguments that describes the layout of the bars.

A histogram is a graph showing frequency distributions.

It is a graph showing the number of observations within each given interval.

This line can be used to predict future values.

import matplotlib.pyplot as plt

<Figure size 576x432 with 0 Axes>

Thus the array becomes[19 5 7 12 9 5 17 11 19 10 7 14 10 13 16 3 16 18 7 19 18 9 11 4

9.5 32.25 -5.75

The following are the outliers in the boxplot:[20,27]

Saving tips.csv to tips.csv

['anagrams', 'anscombe', 'attention', 'brain_networks', 'car_crashes', 'diamonds', 'dots',

total_bill tip sex smoker day time size

Saving student_scores.csv to student_scores.csv

Saving salary_data.csv to salary_data.csv

Problem Implementation Time Viva Total

You might also like