0% found this document useful (0 votes)

79 views47 pages

Data Science Lab Manual Full

Uploaded by

vinodha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views47 pages

Data Science Lab Manual Full

Uploaded by

vinodha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

EX.NO.

1:
DOWNLOAD ,INSTALL AND EXPLORE THE FEATURES OF
NUMPY,SCIPY,JUPYTER,STATSMODELS AND PANDAS PACKAGES

Aim:
To download and install various packages like NUMPY, SCIPY, JUPYTER,
STATSMODELS AND PACKAGES in python.

Step 1 − Select Version of Python to Install

Python has various versions available with differences between the syntax and working of different
versions of the language. We need to choose the version which we want to use or need. There are
different versions of Python 2 and Python 3 available.

Step 2 − Download Python Executable Installer

On the web browser, in the official site of python (www.python.org), move to the Download for
Windows section.

All the available versions of Python will be listed. Select the version required by you and click on
Download. Let suppose, we chose the Python version.
On clicking download, various available executable installers shall be visible with different operating
system specifications. Choose the installer which suits your system operating system and download
the instlaller. Let suppose, we select the Windows installer(64 bits).

The download size is less than 30MB.

Step 3 − Run Executable Installer

We downloaded the Python 3.9.1 Windows 64 bit installer Run the installer. Make sure to select both
the checkboxes at the at the bottom and then click Install New.
On clicking the Install Now, The installation processstarts.

The installation process will take few minutes to complete and once the installation is successful, the
following screen is displayed.

Step 4 − Verify Python is installed on Windows

To ensure if Python is succesfully installed on your system. Follow the given steps −

 Open the command prompt.

 Type ‘python’ and press enter.
 The version of the python which you have installed will be displayed if the python is
successfully installed on your windows.

Step 5 − Verify Pip was installed

Pip is a powerful package management system for Python software packages. Thus, make sure that
you have it installed.

To verify if pip was installed, follow the given steps −

 Open the command prompt.

 Enter pip –V to check if pip was installed.
 The following output appears if pip is installedsuccessfully.
Step 6: install Packages using Pip

Pip allows to you to install various python packages like NUMPY, SCIPY, MATPLOTLIB,
PANDAS, JUPYTER, STATSMODELS using the command pip install packagename.

Result:
Thus we have successfully installed python ,pip and various python packages on our Windows
system.
EX.NO:2 WORKING WITH NUMPY ARRAYS

AIM:

To write the program for NUMPY ARRAY packages on python program.

ALGORITHM:
1. First download and install numpy packages in python by using the command pip install numpy.
2. Write the one-Dimensional arrays and n-dimensional arrays inNumPy.
3. Apply some linear algebra operations to n-dimensional arrays without using for-loops.
4. Write the axis and shape properties for n-dimensional arrays. The write NumPy dimensions
are axes.
5. Write matrix with n rows and m columns, shape will be (n,m).

6. The length of the shape tuple is the number of axes, ndim. To write the total number of elements
of the array.

7. Create or specify datatype using standard Python types. To stop the numpy program.
PROGRAM EX.NO:1

import numpy as np

arr = np.array([1, 2, 3, 4, 5]) print(arr)

print(type(arr))

Output:

[1 2 3 4 5]

Ex.No:2
import numpy as np
arr = np.array([[1, 2, 3], [4, 5, 6]])
print(arr)
print(type(arr))

Output:
[[1 2 3]
[4 5 6]]

Multi-Dimensional Array:

Ex.No:3

import numpy as np

arr = np.array([[[1, 2, 3], [4, 5, 6]], [[1, 2, 3], [4, 5, 6]]])

print(arr)

Output:

Numpy multi dimensional array in python

[[[1 2 3]

[4 5 6]]

[[1 2 3]

[4 5 6]]]
Ex.No:4

import numpy as np list_1 = [1, 2, 3, 4]

list_2 = [5, 6, 7, 8]

list_3 = [9, 10, 11, 12]

sample_array = np.array([list_1,list_2,list_3]) print("Numpy array :")

print(sample_array)

print("Shape of the array :",sample_array.shape)

Output:
Numpy array :

[[ 1 2 3 4]

[ 5 6 7 8]

[ 9 10 11 12]]

Shape of the array : (3, 4)

Ex.No:5

import numpy as np

arr = np.array([1, 2, 3, 4]) print(arr[2] + arr[3])

output:

Ex.No:6

import numpy as np

arr = np.array([[1,2,3,4,5], [6,7,8,9,10]])

print('2nd element on 1st row: ', arr[0, 1])

Output:

2nd element on 1st dim: 2

Ex.No:7

import numpy as np

arr = np.array([[[1, 2, 3], [4, 5, 6]], [[7, 8, 9], [10, 11, 12]]])

print(arr[0, 1, 2])

Output:

Result:

Thus the Various Array operations on Numpy Arrays has been verified and done successfully.
EX.NO:3 PANDAS DATA FRAMES

AIM:

To write the program for PANDAS DATA FRAMES packages on python program.

ALGORITHM:

1. To write the Pandas data frames program.

2. To pandas provides the function to read data stored as a .csv file into a pandas DataFrame

3. To pandas supports many different file formats or data sources. The select to the subset of the

data frames.

4. To create the new rows,columns derived from existing data. To combine the multiple tables.

5. To handle to the time series of data. To stop the program.

EX.NO:1

PROGRAM

import pandas as pd

S = pd.Series([11, 28, 72, 3, 5, 8])

Print(S)

OUTPUT:

0 11
1 28
2 72
3 3
4 5
5 8
dtype: int64

EX.NO: 2

import pandas as pd a = [1, 7, 2]

myvar = pd.Series(a)

print(myvar)

OUTPUT:
0 1
1 7
2 2
dtype: int64

EX.NO:3

import pandas as pd data = {

"calories": [420, 380, 390],
"duration": [50, 40, 45]
}
df = pd.DataFrame(data) print(df)
OUTPUT

calories duration 0 420 50

1 380 40

2 390 45

EX.NO:4
import pandas as pd data = {
"calories": [420, 380, 390],
"duration": [50, 40, 45]
}
df = pd.DataFrame(data, index = ["day1", "day2", "day3"]) print(df)

OUTPUT

calories duration

day 1 420 50

day 2 380 40

day 3 390 45

EX.NO:5

import pandas as pd a = [1, 7, 2]

myvar = pd.Series(a, index = ["x", "y", "z"])

print(myvar)

OUTPUT
x 1
y 7
z 2
dtype: int64
EX.NO:6

import pandas as pd

calories = {"day1": 420, "day2": 380, "day3": 390}

myvar = pd.Series(calories)

print(myvar)

OUTPUT:
day1 420
day2 380
day3 390
dtype: int64

EX.NO:7

import pandas as pd

calories = {"day1": 420, "day2": 380, "day3": 390} myvar = pd.Series(calories, index = ["day1",

"day2"])

print(myvar)

Sample Output:

day1 420

day2 380

dtype: int64

Result:

Thus the Various data Manipulation operations on PANDAS DATAFRAMES has been
verified and done successfully.
EX:NO:4
READING DATA FROM TEXT FILES, EXCEL AND THE WEB
AND EXPLORING VARIOUS COMMANDS FOR DOING
DESCRIPTIVE ANALYTICS ON THE IRISDATA SET.

AIM:

To read the data from text files, excel and the web and exploring various commands for doing
descriptive analytics on the iris data set.

Steps:

Kaggle DataSet:

https://www.kaggle.com/datasets/uciml/iris

1. Download the IRIS data set from the kaggle website and save the Documents.

Step 2:

Open the jupyter notebook and the type the following commands import pandas as pd

iris=pd.read_csv("Documents/iris.data.csv")

iris
Step 3:

Edit the Program in Python IDLE

import numpy as np
import pandas as pd
import seaborn as sns
sns.set_palette('husl')
import matplotlib.pyplot as plt
from subprocess import check_output
data = pd.read_csv('C:/Users/Welcome/Downloads/archive/Iris.csv')
data.head()
data.info()
data.describe()
data['Species'].value_counts()
tmp = data.drop('Id', axis=1)
g = sns.pairplot(tmp, hue='Species', markers='+')
plt.show()
g = sns.violinplot(y='Species', x='SepalLengthCm', data=data, inner='quartile')
plt.show()
g = sns.violinplot(y='Species', x='SepalWidthCm', data=data, inner='quartile')
plt.show()
g = sns.violinplot(y='Species', x='PetalLengthCm', data=data, inner='quartile')
plt.show()
g = sns.violinplot(y='Species', x='PetalWidthCm', data=data, inner='quartile')
plt.show()
<class 'pandas.core.frame.Dataframe'> RangeIndex : 150 entries, 0 to 149 Data columns (total 5

columns):
OUTPUT:

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 150 entries, 0 to 149
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 150 non-null int64
1 SepalLengthCm 150 non-null float64
2 SepalWidthCm 150 non-null float64
3 PetalLengthCm 150 non-null float64
4 PetalWidthCm 150 non-null float64
5 Species 150 non-null object
dtypes: float64(4), int64(1), object(1)
memory usage: 7.2+ KB

In[44]: iris.tail() Out[44]:

In[45]: iris.species.unique()
Out[45]: array(['setosa', 'versicolor', 'virginica'], dtype=object)

In[46]: iris1 =iris.groupby('species',as_index=false)["sepal_length"].count() iris1

In[59]: ax=iris[iris.species==’Iris-setosa’].plot.scatter(x=’sepal_length’,y=’sepal_width’)

ax=iris[iris.species==’Iris-versicolor’].plot.scatter(x=’sepal_length’,y=’sepal_width’)

ax=iris[Iris.species==’Iris-virgnica’].plot.scatter(x=’sepal_length’,y=’sepal_width)

ax.set_xlabel(“Sepal Length”)

ax.set_ylabel(“Sepal Width”)

ax.set_title(“Relationship between Sepal Length and Width”) Out[59]: Text(0.5, 1.0,'Relationship

between Sepal Length and width')

In[49] : sns.Face grid(iris, hue="species", size=6) sns.map(plt.scatter, "petal_Length", "petal_width")

sns.add_Legend()

plt.title("Relationship between Petal Length and Width")

Result:
To read the data from text files, excel and the web and exploring various commands for doing
descriptive analytics on the iris data set.
EX:NO:5 USE THE DIABETES DATASET FROM UCI AND PIMA
INDIANS DIABETES DATASET FOR PERFORMING
THE FOLLOWING:

Aim:

a. Implement Univariate analysis: Frequency, Mean, Median, modevariance,

Standard Deviation,Skewness and Kurtosis from UCI Dataset

b. Bivariate analysis:Linear and logistic regression modeling

c. Mulitiple Regression analysis

d. Also compare the results of the above analysis for the twodatasets.
STEPS:

STEP1: Download the Pima Indians Diabetes Dataset STEP 2: Open the

jupyter notebook and type the following

commands

PROGRAM:

In [1]: pima.insulin().values.any() Out[1]: False

In[2] : Pima. describe() Out[2]:

import pandas pd
pima=pd.read_csv('C:/Users/Welcome/Downloads/diabetes.csv’)
import matplotlib.pyplot as plt
import seaborn as sns
sns.set(color_codes =True)
pima

Out[3]:

In[4]: dial1 =pima[pima.Outcome==1] dial0=pima[pima.Outcome==0]

In[5]: dial1

out[5]:
In[6]: dial0 Out[6]:

In[7]: sns.countplot(x=pima.Outcome) plt.title("count plot for outcome")

Out[7]: Text(0.5,1.0,'count Outcome plot for')

Out[8]:(65.104166666666667, 34.8958333333336)

Pregnant Variables visualization

In[9]:

plt.figure(figsize=(20,6))

Plt.subplot(1,3,1)

Sns.set_style(“dark”)
Plt.title(“Histogram for Pregnancies”)

Sns.displot(pima.pregnancies,kde=False)

Plt.subplot(1,3,2)

Sns.displot(dial0.Pregnancies,kde=False,color=”Blue”, label= “Preg for Outcome=0”)

Sns.displot(dial1.Pregnancies,kde=True,color=”Gold”, label= “Preg for Outcome=1”)

Plt.title(“Histograms for Preg by Outcome”)

Plt.legend()

Plt.subplot(1,3,3)

Sns.boxplot(x=pima.outcome,y=pima.pregnancies)

Plt.title(“Boxplot for Preg by Outcome”)

Out[9]:Text(0.5, 1.0, 'Boxplot for preg by Outcome')

Screening Variable - Glucose

plt.figure(figsize=(20,6)

plt.subplot(1,3,1)

plt.title(‘histogram for glucose’)

sns.distplot(pima.glucose,kde=false)
plt.subplot(1,3,2)

sns.distplot(dial0.Glucose,kde=False,color=’gold’,label=’Gluc for outcome=0’)

sns.distplot(dial0.Glucose,kde=False,color=’gold’,label=’Gluc for outcome=1’)

Plt.title(“histograms for Glucose by outcome”)

Plt.legend()

Plt.subplot(1,3,3)

Sns.boxplot(x=pima.outcome,y=pima.Glucose)

Plt.title(“Boxplot for Glucose by outcome”)

Out[10]: Text (0.5, 1.0, 'Boxplot for Glucose by Outcome)

In[11]:
import pandas as pd
pima=pd.read_csv(‘c:/users/welcome/downloads/diabetes.csv’)
import matplot.pyplot as plt
import seaborn as sns
sns.set(color_codes=true)
pima
Screening of association Variables to study Bivariate relationship

In[12]: sns.pairplot (pima, vars=["pregnancies", "Glucose", “Bloodpressure",

"SkinThickness"]

plt.title ("pairplot of variables by Outcome")

Out[12]: Text(0.5,1.0,'pairplot of variables by Outcome')

Correlation Mapping

In[13]: cor=pima.corr(method='pearson') cor

Out[13]:

Heatmap chart

In[14]: sns.heatmap(cor) Out[14]: <AxesSubplot:>

Logistic Regression:
Cols=[‘Pregnancies’,’Glucose’,’BloodPressure’,’SkinThickness’,’Insulin’,’BMA’]
x=pima[cols]
y=pima.Outcome

import statmodels.api as sm
logit_model=sm.logit(y,x)
result=logit_model.fit()
print(result.summary())

Result:

Thus the Program was executed successfully in Diabetic dataset in python.

EX:NO:6
APPLY AND EXPLORE VARIOUS PLOTTING
FUNCTIONS ON UCI DATASET

Aim:

To implement the,
a. Normal Curves
b. Density and contour plots
c. Correlation and scatter plots
d. Histograms
e. Three dimensional plotting Using UCI data sets.

Drawing Plot

 Matplotlib is a python 2D plotting library and the most widely used library for data
Visualization. It Provides an extensive set of plotting APIs to create various plots such
as Scatter, bar, box, and distribution plots with custom styling and annotation. Detailed
documentation for matplotlib can be found at https://matplotlib.org/

 Seaborn is also a python data Visualizing library based on matplotlib. It Provides a high-
level interface for drawing innovative and informative statistical Charts
https://seaborn.pydata.org/.

 Matplotlib is a library for creating 2D plots of arrays in python.

Matplotlib is written in python and makes use of Numpy arrays. Seaborn which is built on top
of matplotlib, is a library for making elegant charts in python and well-integrated with pandas
dataframe.
To Create graphs and Plots, we need to import matplotlib.pyplot and seaborn modules.
To display the plots on the jupyter Notebook, we need to provide a directive %matplotlib inline.
Only if the directive is provided, the plots will be displayed in the notebook.
PROGRAM:

import matplotlib.pyplot as plt

import seaborn as sn

%matplotlib inline

2. Bar Chart

The bar Chart is a frequency chart for a qualitative variable. A bar chart can be used to accesss
the most- occurring and least occurring categories within a dataset. To draw a bar chart, call ‘barplot()’
of the seaborn library. The data frame should be passed in the parameter data here.
Matplotlib is written in python and makes use of Numpy arrays. Seaborn Which is built on top od
matplotlib, is a library for making charts in python, and well-integrated with pandas dataframe. To
create graphs and plots, we need to import ‘matplotlib.pyplot’ and ‘seaborn’ modules. To display the
plots on the jupyter Notebook, we need to provide a directive ‘%matplotlib inline’. Only if the directive
is provided, the plots will be displayed in the notebook.

PROGRAM:

import matplotlib.pyplot as plt

import seaborn as sn

%matplotlib inline

A bar chart displays a set of categories in one axis and the percentage or frequencies of a variable
for those categories in another axis. The height of the bar is either less or more depending upon the
frequency value. In a Vertical Bar Chart, the X-axis will represent categories and Y-axis will represent
frequencies. In a Horizontal Bar Chart, it is the inverse. In a Vertical Bar Chart, the bars
grow downwards below the X-axis for negative values. In a Horizontal Bar Chart, the bars
grow leftwards from the Y-axis for negative values.
Program:
import seaborn as sns

import matplotlib.pyplot as plt # read a titanic.csv file

# from seaborn library

df = sns.load_dataset ('titanic') #who v/s fare barplot

sns.barplot(x= 'who', y= 'fare',data =df) #show the plot

plt.show()

3. Pandas Histogram

Let’s understand how to create histogram in pandas and how it is useful.Histograms are
very useful in statistical analysis. Histograms are generally used to represent the frequency
distribution for a numeric array, split into small equal-sized bins. As we used pandas to
work with tabular data, it’s important to know how to work with histograms in a pandas
dataframe. The pandas.dataframe.hist and pandas.dataframe.plot.hist are two popular
functions. You can use them to directly plot histograms from pandas dataframes.
plt.hist(df[‘fare’])
Program:

import seaborn as sns

import matplotlib.pyplot as plt # read a titanic.csv file
# from seaborn library
df = sns.load_dataset ('titanic') #who v/s fare barplot
plt.hist(df['fare'])
plt.show()

4. Distribution Plot

A Distribution or density plot depicts the distribution of data over a continuous interval. A
density plot is like a smoothed histogram and visualizes the distribution of data over a
continuous interval. So a density plot also gives into what might be the distribution of the
population.

sns.distplot(df['fare'])

Program:

import seaborn as sns

import matplotlib.pyplot as plt # read a titanic.csv file
# from seaborn library
df = sns.load_dataset ('titanic') #who v/s fare barplot
sns.distplot(df['fare'])
plt.show()
5. Box Plot

Box Plot is the visual representation of the depicting groups of numerical data through their
quartiles. Boxplot is also used for detect the outlier in data set. It captures the summary of
the data efficiently with a simple box and whiskers and allows us to compare easily across
groups. Boxplot summarizes a sample data using 25th, 50th and 75th percentiles. These
percentiles are also known as the lower quartile, median and upper quartile.
A box plot consist of 5 things.
 Minimum
 First Quartile or 25%
 Median (Second Quartile) or 50%
 Third Quartile or 75%
 Maximum

Program:
import seaborn as sns
import matplotlib.pyplot as plt # read a titanic.csv file
# from seaborn library
df = sns.load_dataset ('titanic') #who v/s fare barplot
box=sns.boxplot(df['fare'])
plt.show()
To draw the boxplot, call boxplot() of the seaborn library.
box=sns.boxplot(df[‘fare’])

6. Scatter Plot

A scatter plot is a means to represent data in a graphical format. A simple scatter plot makes
use of the Coordinate axes to plot the points, based on their values and reveal the correlation
present between the variables.
plt.scatter(x,y)

Program:

import numpy
import matplotlib.pyplot as plt
x=numpy.random.normal(5.0,1.0,1000)
y=numpy.random.normal(10.0,2.0,1000)
plt.scatter(x,y)
plt.show()
7. Pair Plot

Pair Plots are easier method to draw scatter plots if there are more than two variables. It can
be plotted by using the pairplot() method.

Program:
import seaborn as sns
import matplotlib.pyplot as plt
df=sns.load_dataset('tips')
sns=pairplot(df, hue='sex')
plt.show()
8. Correlation and Heatmap

Correlation is used for measuring the strength and direction of the linear relationship
between two continuous random variables x and y. A positive correlation means the
variables increase or decrease together. A negative correlation means if one variable
increases then the other decrease.

correaltion values can be computed using the 'corr()' method of the DaraFrame and
rendered using heatmap.

Program:
# import modules
import matplotlib.pyplot as mp
import pandas as pd
import seaborn as sb

# import file with data

data = pd.read_csv("C:\\Users\\Vanshi\\Desktop\\bestsellers.csv")

# prints data that will be plotted

# columns shown here are selected by corr() since
# they are ideal for the plot
print(data.corr())

# plotting correlation heatmap

dataplot = sb.heatmap(data.corr(), cmap="YlGnBu", annot=True)

# displaying heatmap
mp.show()
Result:

Thus the Program to draw various plots was executed sucessfully in UCI datasets.
.
EX:NO:7
VISUALIZING GEOGRAPHICAL DATA WITH BASEMAP

Aim:

To visualize the various geographical data with the help of basemap.

Basemap

Basemap is a great tool for creating maps using python in a simple way. It’s
a matplotlib extension, so it has got all its features to create data visualizations, and adds
the geographical projections and some datasets to be able to plot coast lines, countries,
and so on directly from the library.

Basemap has got some documentation, but some things are a bit more difficult to find. I
started this documentation to extend a little the original documentation and examples, but
it grew a little, and now covers many of the basemap possibilities.

To install and import Basemap package in python use pip install Basemap.

Basemap methods

1. Draw countries

Draws the USA counties from the layer included with the library

drawcounties(linewidth=0.1, linestyle=’solid’, color=’k’, antialiased=1,

facecolor=’none’, ax=None, zorder=None, drawbounds=False)

 linewidth sets, of course, the line width in pixels

 linestyle sets the line type. By default is solid, but can be dashed, or any matplotlib
option

 color is k (black) by default. Follows also matplotlib conventions

 antialiased is true by default

 zorder sets the layer position. By default, the order is set by Basemap
PROGRAM:

from mpl_toolkits.basemap import Basemap

import matplotlib.pyplot as plt

map = Basemap(llcrnrlon=-93.,llcrnrlat=40.,urcrnrlon=-75.,urcrnrlat=50.,

resolution='i', projection='tmerc', lat_0 = 40., lon_0 = -80)

map.drawmapboundary(fill_color='aqua')map.fillcontinents(color='#cc9955',
lake_color='aqua')

map.drawcounties()

plt.show()

 Draws the country borders from the layer included with the library.

 The function has the following arguments:

 drawcountries(linewidth=1.0, linestyle=’solid’, color=’k’, antialiased=1, ax=None,

zorder=None)

 linewidth sets, of course, the line width in pixels

 linestyle sets the line type. By default is solid, but can be dashed, or any matplotlib
option
 color is k (black) by default. Follows also matplotlib conventions

 antialiased is true by default

 zorder sets the layer position. By default, the order is set by Basemap

Note that:

The resolution indicated when creating the Basemap instance makes the layer to have a
better or coarser resolution

The coastline is in another function, and the country coasts are not considered coast,
which makes necessary to combine the method with others to get a good map

PROGRAM

from mpl_toolkits.basemap import Basemapimport matplotlib.pyplot as plt

map = Basemap(projection='ortho', lat_0=0, lon_0=0)

map.drawmapboundary(fill_color='aqua')

map.fillcontinents(color='coral',lake_color='aqua')

map.drawcountries()

plt.show()
Drawmap boundary

Draws the earth boundary on the map, with optional filling.

drawmapboundary(color=’k’, linewidth=1.0, fill_color=None, zorder=None,

ax=None)

 linewidth sets, of course, the line width in pixels

 color sets the edge color and is k (black) by default. Follows also matplotlib
conventions

 fill_color sets the color that fills the globe, and is None by default . Follows also
matplotlib conventions

 zorder sets the layer position. By default, the order is set by Basemap

PROGRAM

from mpl_toolkits.basemap

import Basemapimport matplotlib.pyplot as plt

plt.figure(0)

map= Basemap(projection='ortho',lon_0=0,lat_0=0,resolution='c')

map.drawmapboundary()

plt.figure(1)

map= Basemap(projection='sinu',lon_0=0,resolution='c')

map.drawmapboundary(fill_color='aqua')

plt.show()
Orthographic projection result

Sinusoidal Projection result

Drawstates

Draws the American countries states borders from the layer included with the library.
Draws also the Australian states.

drawstates(linewidth=0.5, linestyle=’solid’, color=’k’, antialiased=1, ax=None,

zorder=None)

 linewidth sets, of course, the line width in pixels

 linestyle sets the line type. By default is solid, but can be dashed, or any matplotlib
option

 color is k (black) by default. Follows also matplotlib conventions

 antialiased is true by default

 zorder sets the layer position. By default, the order is set by Basemap

Note that:

 The resolution is fix, and doesn’t depend on the resolution parameter passed to the
class constructor

 The country border is not drawn, creating a strange effect if the method is not
combined with drawcountries

PROGRAM

from mpl_toolkits.basemap

import Basemapimport matplotlib.pyplot as plt

map = Basemap(width=12000000,height=9000000,

rsphere=(6378137.00,6356752.3142),\

resolution='l',area_thresh=1000.,projection='lcc',\

lat_1=45.,lat_2=55,lat_0=50,lon_0=-107.)

map.drawmapboundary(fill_color='aqua')

map.fillcontinents(color='#ddaa66', lake_color='aqua')

map.drawcountries()

map.drawstates(color='0.5')

plt.show()
Etopo

Plots a relief image called etopo taken from the NOAA. The image has a 1’’ arch
resolution, so when zooming in, the results are quite poor.

etopo(ax=None, scale=None, **kwargs)

The scale is useful to downgrade the original image resolution to speed up the process. A
value of 0.5 will divide the size of the image by 4

The image is warped to the final projection, so all projectinos work properly with this
method

PROGRAM

from mpl_toolkits.basemap import Basemapimport matplotlib.pyplot as plt

map = Basemap(llcrnrlon=-10.5,llcrnrlat=33,urcrnrlon=10.,urcrnrlat=46.,

resolution='i', projection='cass', lat_0 = 39.5, lon_0 = 0.)

map.etopo()

map.drawcoastlines()

plt.show()
Fill continents

Draws filled polygons with the continents

fillcontinents(color=‘0.8’, lake_color=None, ax=None, zorder=None, alpha=None)

 color sets the continent color. By default is a gry color. This page explains all the
color options

 lake color sets the color of the lakes. By default doesn’t draw them, but you may set
it to aqua to plot them blue

 alpha is a value from 0 to 1 to set the transparency

 zorder sets the position of the layer related to others. It can be used to hide (or show)
a contourf layer, that should be only on the sea, for instance
PROGRAM

from mpl_toolkits.basemap I

mport Basemapimport matplotlib.pyplot as plt

map = Basemap(projection='ortho',

lat_0=0, lon_0=0)

#Fill the globe with a blue color

map.drawmapboundary(fill_color='aqua')

#Fill the continents with the land

colormap.fillcontinents(color='coral',lake_color='aqua')

map.drawcoastlines()

plt.show()
Shadedrelief

Plots a shaded relief image. The origin is the www-shadedrelief.com web page. The
original image size is 10800x5400

shadedrelief(ax=None, scale=None, **kwargs)

 The scale is useful to downgrade the original image resolution to speed up the
process. A value of 0.5 will divide the size of the image by 4. The original size is
quite big, 10800x5400 pixels

 The image is warped to the final projection, so all projections work properly with
this method

PROGRAM

from mpl_toolkits.basemap import Basemap

import matplotlib.pyplot as plt

map = Basemap(llcrnrlon=-10.5,llcrnrlat=33,urcrnrlon=10.,urcrnrlat=46.,

resolution='i', projection='cass', lat_0 = 39.5, lon_0 = 0.)

map.shadedrelief()

map.drawcoastlines()

plt.show()
Warpimage

Displays an image as a background.

warpimage(image=’bluemarble’, scale=None, **kwargs)

 By default, displays the NASA Bluemarble image

 The image must be in latlon projection, so the x size must be double than the y size

 The image must cover the whole world, with the longitude starting at -180

PROGRAM

from mpl_toolkits.basemap import Basemap

import matplotlib.pyplot as pltimport Image

map = Basemap(projection='ortho', lat_0=0, lon_0=0)

tmpdir = '/tmp'

size = [600, 300]im = Image.open("../sample_files/by.png")

im2 = im.resize(size, Image.ANTIALIAS)im2.save(tmpdir+'/resized.png', "PNG")

map.warpimage(tmpdir+'/resized.png')

map.drawcoastlines()

plt.show()
Result

Thus the various plotting using Basemap has been done successfully.

Questions by Topics S1
No ratings yet
Questions by Topics S1
36 pages
Common Visualization Idioms
0% (1)
Common Visualization Idioms
95 pages
Module 4 - MATHEMATICS AS STATISTICAL TOOL
No ratings yet
Module 4 - MATHEMATICS AS STATISTICAL TOOL
29 pages
Statistics 1232445944520487 1
No ratings yet
Statistics 1232445944520487 1
101 pages
Data Visualization
No ratings yet
Data Visualization
35 pages
Statistics and Probability: Bill Thaddeus Padasas
No ratings yet
Statistics and Probability: Bill Thaddeus Padasas
102 pages
Assignment 4 On Visualization On Graph With Solution
No ratings yet
Assignment 4 On Visualization On Graph With Solution
14 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
IPL PA-nik
100% (1)
IPL PA-nik
6 pages
Business Report: by Sreenath Radhakrishnan
No ratings yet
Business Report: by Sreenath Radhakrishnan
26 pages
Statistics Class Work # 2-1
No ratings yet
Statistics Class Work # 2-1
8 pages
DSL Rough Draft
No ratings yet
DSL Rough Draft
34 pages
Fuel Estimations in Air Transportation
No ratings yet
Fuel Estimations in Air Transportation
16 pages
Blume Et Al - 2018 - Standing Sentinel During Human Sleep
No ratings yet
Blume Et Al - 2018 - Standing Sentinel During Human Sleep
11 pages
Box Plots Questions
No ratings yet
Box Plots Questions
9 pages
Chapter 04 Notes
No ratings yet
Chapter 04 Notes
4 pages
Data Presentation and Interpretation
No ratings yet
Data Presentation and Interpretation
24 pages
Statistics I MTH160
No ratings yet
Statistics I MTH160
22 pages
How Do I Install Numpy?: Numpy Array: Numpy Array Is A Powerful N-Dimensional Array Object Which Is in The Form of Rows
No ratings yet
How Do I Install Numpy?: Numpy Array: Numpy Array Is A Powerful N-Dimensional Array Object Which Is in The Form of Rows
3 pages
Data Visualization Using Matplotlib and Seaborn
No ratings yet
Data Visualization Using Matplotlib and Seaborn
28 pages
NumPy & Pandas
No ratings yet
NumPy & Pandas
27 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
42 pages
ML Project
No ratings yet
ML Project
1 page
HSCC Alg1 Pe 11
No ratings yet
HSCC Alg1 Pe 11
49 pages
Box-and-Whisker Plots
No ratings yet
Box-and-Whisker Plots
6 pages
Fds Lab Record
No ratings yet
Fds Lab Record
84 pages
Unit Vi
No ratings yet
Unit Vi
60 pages
DS Lab Manual
No ratings yet
DS Lab Manual
113 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
CS3361-Data Science Lab Manual - B.rethina Kumar
No ratings yet
CS3361-Data Science Lab Manual - B.rethina Kumar
36 pages
Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials
No ratings yet
Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials
29 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
61 pages
De&v Lab Manual
No ratings yet
De&v Lab Manual
91 pages
FDS Lab Manual-1
No ratings yet
FDS Lab Manual-1
51 pages
Richardson DAA 3e PPT Ch03)
No ratings yet
Richardson DAA 3e PPT Ch03)
47 pages
Fundamentals of Data Science Lab Manual New1
No ratings yet
Fundamentals of Data Science Lab Manual New1
32 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
43 pages
FDS Lab Manual (Print)
No ratings yet
FDS Lab Manual (Print)
43 pages
Fds Record
No ratings yet
Fds Record
69 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
DV Lab2 Updated
No ratings yet
DV Lab2 Updated
12 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
48 pages
S1 Paper For Nick
No ratings yet
S1 Paper For Nick
8 pages
Cs3361-Data Science Lab Manual
No ratings yet
Cs3361-Data Science Lab Manual
44 pages
Introduction To Numpy Pandas and Matplotlib
No ratings yet
Introduction To Numpy Pandas and Matplotlib
2 pages
DSF Lab Exp Full
No ratings yet
DSF Lab Exp Full
88 pages
Fundamentals of Data Science Lab Manual
No ratings yet
Fundamentals of Data Science Lab Manual
34 pages
CS3362 Data Science Laboratory Alok Kumar
No ratings yet
CS3362 Data Science Laboratory Alok Kumar
50 pages
EX - No: 1 Date:: Download Install Explore The Features of Numpy, Scipy, Jupiter, Statsmodels and Pandas Packages
No ratings yet
EX - No: 1 Date:: Download Install Explore The Features of Numpy, Scipy, Jupiter, Statsmodels and Pandas Packages
38 pages
Capstone - Project - Final Report - Hitesh - Dadhich
No ratings yet
Capstone - Project - Final Report - Hitesh - Dadhich
38 pages
Unit 5
No ratings yet
Unit 5
27 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
Datascience Lab Manual
No ratings yet
Datascience Lab Manual
46 pages
DV Lab Manual Modified
No ratings yet
DV Lab Manual Modified
31 pages
FINAL FDS MANUAL Print
No ratings yet
FINAL FDS MANUAL Print
55 pages
Python For Data Science
No ratings yet
Python For Data Science
4 pages
Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024
No ratings yet
Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024
9 pages
3-Numpy Pandas
No ratings yet
3-Numpy Pandas
37 pages
Review of Data Description and Exploratory Data Analysis (EDA)
No ratings yet
Review of Data Description and Exploratory Data Analysis (EDA)
20 pages
RAW Data
No ratings yet
RAW Data
22 pages
Exp1 Ref Doc Installation
No ratings yet
Exp1 Ref Doc Installation
6 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
Fods Lab
No ratings yet
Fods Lab
36 pages
Unit 5 PythonPackages (Matplotlib)
No ratings yet
Unit 5 PythonPackages (Matplotlib)
24 pages
Learning NumPy and Pandas
No ratings yet
Learning NumPy and Pandas
3 pages
Add Maths Sba
No ratings yet
Add Maths Sba
24 pages
Grade 12 Maths P2 Revision DR Msizi Mkhize, UKZN, SAEF
No ratings yet
Grade 12 Maths P2 Revision DR Msizi Mkhize, UKZN, SAEF
91 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Lab Manual Fds
No ratings yet
Lab Manual Fds
44 pages
November 2023 QP-1
No ratings yet
November 2023 QP-1
32 pages
Excel Chart
No ratings yet
Excel Chart
35 pages
Lab 2 DWM
No ratings yet
Lab 2 DWM
13 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
24 pages
Pythonlibraries
No ratings yet
Pythonlibraries
20 pages
Project File On Cognifyz
100% (1)
Project File On Cognifyz
45 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
61 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
62 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
Attachment 3 Python For Data Analysis Lyst9850
No ratings yet
Attachment 3 Python For Data Analysis Lyst9850
31 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
36 pages
Unit 5
No ratings yet
Unit 5
39 pages
ML Sample Programs
No ratings yet
ML Sample Programs
7 pages
Introduction To Numpy
No ratings yet
Introduction To Numpy
13 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
59 pages
Module 4
No ratings yet
Module 4
4 pages
ML Practice Session 2
No ratings yet
ML Practice Session 2
7 pages
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet