0% found this document useful (0 votes)

61 views29 pages

Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials

Data Analysis using Python

Uploaded by

megha16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views29 pages

Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials

Data Analysis using Python

Uploaded by

megha16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Introduction to NumPy, Pandas and Matplotlib

Data Analysis
Data Analysis is a process of inspecting, cleaning, transforming, and modeling data
with the goal of discovering useful information, suggesting conclusions, and
supporting decision-making.
Stpes for Data Analysis, Data Manipulation and Data Visualization:
Tranform Raw Data in a Desired Format
Clean the Transformed Data (Step 1 and 2 also called as a Pre-processing of Data)
Prepare a Model
Analyse Trends and Make Decisions

NumPy

NumPy is a package for scientific computing.

Multi dimensional array
Methods for processing arrays
Element by element operations
Mathematical operations like logical, Fourier transform, shape manipulation,
linear algebra and random number generation
In [1]:
import numpy as np
Ndarray - NumPy Array
The ndarray is a multi-dimensional array object consisting of two parts -- the
actual data, some metadata which describes the stored data. They are indexed
just like sequence are in Python, starting from 0
Each element in ndarray is an object of data-type object called dtype
An item extracted from ndarray, is represented by a Python object of an array
scalar type
Single Dimensional Array
Creating a Numpy Array
In [2]:
# Creating a single-dimensional array
a = np.array([1,2,3]) # Calling the array function
print(a)
[1 2 3]
In [3]:
# Creating a multi-dimensional array
# Each set of elements within a square bracket indicates a row
# Array of two rows and two columns
b = np.array([[1,2], [3,4]])
print(b)
[[1 2]
[3 4]]
In [4]:
# Creating an ndarray by wrapping a list
list1 = [1,2,3,4,5] # Creating a list
arr = np.array(list1) # Wrapping the list
print(arr)
[1 2 3 4 5]
In [5]:
# Creating an array of numbers of a specified range
arr1 = np.arange(10, 100) # Array of numbers from 10 up to and excludin
g 100
print(arr1)
[10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 3
1 32 33
34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 5
5 56 57
58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 7
9 80 81
82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99]
In [6]:
# Creating a 5x5 array of zeroes
arr2 = np.zeros((5,5))
print(arr2)
[[0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0.]
[0. 0. 0. 0. 0.]]
In [7]:
# Creating a linearly spaced vector, with spacing
vector = np.linspace(0, 20, 5) # Start, stop, step
print(vector)
[ 0. 5. 10. 15. 20.]
In [8]:
# Creating Arrays from Existing Data
x = [1,2,3]
# Used for converting Python sequences into ndarrays
c = np.asarray(x) #np.asarray(a, dtype = None, order = None)
print(c)
[1 2 3]
In [9]:
# Converting a linear array of 8 elements into a 2x2x2 3D array
arr3 = np.zeros(8) # Flat array of eight zeroes
arr3d = arr3.reshape((2,2,2)) # Restructured array
print(arr3d)
[[[0. 0.]
[0. 0.]]

[[0. 0.]
[0. 0.]]]
In [10]:
# Flatten rgw 3d array to get back the linear array
arr4 = arr3d.ravel()
print(arr4)
[0. 0. 0. 0. 0. 0. 0. 0.]
Indexing of NumPy Arrays
In [11]:
# NumPy array indexing is identical to Python's indexing scheme
arr5 = np.arange(2, 20)
element = arr5[6]
print(element)
8
In [12]:
# Python's concept of lists slicing is extended to NumPy.
# The slice object is constructed by providing start, stop, and step parame
ters to slice()
arr6 = np.arange(20)
arr_slice = slice(1, 10, 2) # Start, stop & step
element2 = arr6[6]
print(arr6[arr_slice])
[1 3 5 7 9]
In [13]:
# Slicing items beginning with a specified index
arr7 = np.arange(20)
print(arr7[2:])
[ 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19]
In [14]:
# Slicing items until a specified index
print(arr7[:15])
[ 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14]
In [15]:
# Extracting specific rows and columns using Slicing
d = np.array([[1,2,3], [3,4,5], [4,5,6]])
print(d[0:2, 0:2]) # Slice the first two rows and the first two columns
[[1 2]
[3 4]]
NumPy Array Attributes
In [16]:
print(d.shape) # Returns a tuple consisting of array dimensions
print(d.ndim) # Attribute returns the number of array dimensions
print(a.itemsize) # Returns the length of each element of array in bytes
(3, 3)
2
8
In [17]:
y = np.empty([3,2], dtype = int) # Creates an uninitialized array of specifie
d shape and dtype
print(y)
[[140468392404648 140468392404648]
[ 0 0]
[ 0 0]]
In [18]:
# Returns a new array of specified size, filled with zeros
z = np.zeros(5) # np.zeros(shape, dtype = float)
print(z)
[0. 0. 0. 0. 0.]
Reading & Writing from Files
In [19]:
# NumPy provides the option of importing data from files directly into nda
rray using the loadtxt function
# The savetxt function can be used to write data from an array into a te
xt file
#import os
#print(os.listdir('../input'))
arr_txt = np.loadtxt('../input/data_file.txt')
np.savetxt('newfilex.txt', arr_txt)
In [20]:
# NumPy arrays can be dumped into CSV files using the savetxt function
and the comma delimiter
# The genfromtxt function can be used to read data from a CSV file into
a NumPy array
arr_csv = np.genfromtxt('../input/Hurricanes.csv', delimiter = ',')
np.savetxt('newfilex.csv', arr_csv, delimiter = ',')
Pandas
Pandas is an open-source Python library providing efficient, easy-to-use data
structure and data analysis tools. The name Pandas is derived from "Panel Data" -
an Econometrics from Multidimensional Data. Pandas is well suited for many
different kinds of data:
Tabular data with heterogeneously-type columns.
Ordered and unordered time series data.
Arbitary matrix data with row and column labels.
Any other form observational/statistical data sets. The data actually need not be
labeled at all to be placed into a pandas data structure.
Pandas provides three data structure - all of which are build on top of the NumPy
array - all the data structures are value-mutable
Series (1D) - labeled, homogenous array of immutable size
DataFrames (2D) - labeled, heterogeneously typed, size-mutable tabular data
structures
Panels (3D) - Labeled, size-mutable array
In [21]:
import pandas as p
Series
A Series is a single-dimensional array structures that stores homogenous data i.e.,
data of a single type.
All the elements of a Series are value-mutable and size-immutable
Data can be of multiple data types such as ndarray, lists, constants, series, dict
etc.
Indexes must be unique, hashable and have the same length as data. Defaults to
np.arrange(n) if no index is passed.
Data type of each column; if none is mentioned, it will be inferred; automatically
Deep copies data, set to false as default
Creating a Series
In [22]:
# Creating an empty Series
series = pd.Series() # The Series() function creates a new Series
print(series)
Series([], dtype: float64)
In [23]:
# Creating a series from an ndarray
# Note that indexes are a assigned automatically if not specifies
arr = np.array([10,20,30,40,50])
series1 = pd.Series(arr)
print(series1)
0 10
1 20
2 30
3 40
4 50
dtype: int64
In [24]:
# Creating a series from a Python dict
# Note that the keys of the dictionary are used to assign indexes during
conversion
data = {'a':10, 'b':20, 'c':30}
series2 = pd.Series(data)
print(series2)
a 10
b 20
c 30
dtype: int64
In [25]:
# Retrieving a part of the series using slicing
print(series1[1:4])
1 20
2 30
3 40
dtype: int64
DataFrames
A DataFrame is a 2D data structure in which data is aligned in a tabular fashion
consisting of rows & columns
A DataFrame can be created using the following constructor -
pandas.DataFrame(data, index, dtype, copy)
Data can be of multiple data types such as ndarray, list, constants, series, dict etc.
Index Row and column labels of the dataframe; defaults to np.arrange(n) if no
index is passed
Data type of each column
Creates a deep copy of the data, set to false as default
Creating a DataFrame
In [26]:
# Converting a list into a DataFrame
list1 = [10, 20, 30, 40]
table = pd.DataFrame(list1)
print(table)
0
0 10
1 20
2 30
3 40
In [27]:
# Creating a DataFrame from a list of dictionaries
data = [{'a':1, 'b':2}, {'a':2, 'b':4, 'c':8}]
table1 = pd.DataFrame(data)
print(table1)
# NaN (not a number) is stored in areas where no data is provided
a b c
0 1 2 NaN
1 2 4 8.0
In [28]:
# Creating a DataFrame from a list of dictionaries and accompaying row i
ndices
table2 = pd.DataFrame(data, index = ['first', 'second'])
# Dict keys become column lables
print(table2)
a b c
first 1 2 NaN
second 2 4 8.0
In [29]:
# Converting a dictionary of series into a DataFrame
data1 = {'one':pd.Series([1,2,3], index = ['a', 'b', 'c']),
'two':pd.Series([1,2,3,4], index = ['a', 'b', 'c', 'd'])}
table3 = pd.DataFrame(data1)
print(table3)
# the resultant index is the union of all the series indexes passed
one two
a 1.0 1
b 2.0 2
c 3.0 3
d NaN 4
DataFrame - Addition & Deletion of Columns
In [30]:
# A new column can be added to a DataFrame when the data is passed
as a Series
table3['three'] = pd.Series([10,20,30], index = ['a', 'b', 'c'])
print(table3)
one two three
a 1.0 1 10.0
b 2.0 2 20.0
c 3.0 3 30.0
d NaN 4 NaN
In [31]:
# DataFrame columns can be deleted using the del() function
del table3['one']
print(table3)
two three
a 1 10.0
b 2 20.0
c 3 30.0
d 4 NaN
In [32]:
# DataFrame columns can be deleted using the pop() function
table3.pop('two')
print(table3)
three
a 10.0
b 20.0
c 30.0
d NaN
DataFrame - Addition & Deletion of Rows
In [33]:
# DataFrame rows can be selected by passing the row lable to the loc() f
unction
print(table3.loc['c'])
three 30.0
Name: c, dtype: float64
In [34]:
# Row selection can also be done using the row index
print(table3.iloc[2])
three 30.0
Name: c, dtype: float64
In [35]:
# The append() function can be used to add more rows to the DataFrame
data2 = {'one':pd.Series([1,2,3], index = ['a', 'b', 'c']),
'two':pd.Series([1,2,3,4], index = ['a', 'b', 'c', 'd'])}
table5 = pd.DataFrame(data2)
table5['three'] = pd.Series([10,20,30], index = ['a', 'b', 'c'])
row = pd.DataFrame([[11,13],[17,19]], columns = ['two', 'three'])
table6 = table5.append(row)
print(table6)
one three two
a 1.0 10.0 1
b 2.0 20.0 2
c 3.0 30.0 3
d NaN NaN 4
0 NaN 13.0 11
1 NaN 19.0 17
/opt/conda/lib/python3.6/site-packages/pandas/core/frame.py:6211: FutureWa
rning: Sorting because non-concatenation axis is not aligned. A future versi
on
of pandas will change to not sort by default.

To accept the future behavior, pass 'sort=False'.

To retain the current behavior and silence the warning, pass 'sort=True'.

sort=sort)
In [36]:
# The drop() function can be used to drop rows whose labels are provide
d
table7 = table6.drop('a')
print(table7)
one three two
b 2.0 20.0 2
c 3.0 30.0 3
d NaN NaN 4
0 NaN 13.0 11
1 NaN 19.0 17
Importing & Exporting Data
In [37]:
# Data can be loaded into DataFrames from input data stored in the CSV
format using the read_csv() function
table_csv = pd.read_csv('../input/Cars2015.csv')
In [38]:
# Data present in DataFrames can be written to a CSV file using the to_c
sv() function
# If the specified path doesn't exist, a file of the same name is automatic
ally created
table_csv.to_csv('newcars2015.csv')
In [39]:
# Data can be loaded into DataFrames from input data stored in the Exce
lsheet format using read_excel()
sheet = pd.read_excel('cars2015.xlsx')
In [40]:
# Data present in DataFrames can be written to a spreadsheet file using t
o_excel()
#If the specified path doesn't exist, a file of the same name is automatica
lly created
sheet.to_excel('newcars2015.xlsx')

Matplotlib

Matplotlib is a Python library that is specially designed for the development of

graphs, charts etc., in order to provide interactive data visualisation
Matplotlib is inspired from the MATLAB software and reproduces many of it's
features
In [41]:
# Import Matplotlib submodule for plotting
import matplotlib.pyplot as plt
Plotting in Matplotlib
In [42]:
plt.plot([1,2,3,4]) # List of vertical co-ordinates of the points plotted
plt.show() # Displays plot
# Implicit X-axis values from 0 to (N-1) where N is the length of the list
In [43]:
# We can specify the values for both axes
x = range(5) # Sequence of values for the x-axis
# X-axis values specified - [0,1,2,3,4]
plt.plot(x, [x1**2 for x1 in x]) # vertical co-ordinates of the points plotted:
y = x^2
plt.show()

In [44]:
# We can use NumPy to specify the values for both axes with greater pre
cision
x = np.arange(0, 5, 0.01)
plt.plot(x, [x1**2 for x1 in x]) # vertical co-ordinates of the points plotted:
y = x^2
plt.show()

Multiline Plots
In [45]:
# Multiple functions can be drawn on the same plot
x = range(5)
plt.plot(x, [x1 for x1 in x])
plt.plot(x, [x1*x1 for x1 in x])
plt.plot(x, [x1*x1*x1 for x1 in x])
plt.show()

In [46]:
# Different colours are used for different lines
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*x1 for x1 in x],
x, [x1*x1*x1 for x1 in x])
plt.show()

Grids
In [47]:
# The grid() function adds a grid to the plot
# grid() takes a single Boolean parameter
# grid appears in the background of the plot
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*2 for x1 in x],
x, [x1*4 for x1 in x])
plt.grid(True)
plt.show()

Limiting the Axes

In [48]:
# The scale of the plot can be set using axis()
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*2 for x1 in x],
x, [x1*4 for x1 in x])
plt.grid(True)
plt.axis([-1, 5, -1, 10]) # Sets new axes limits
plt.show()

In [49]:
# The scale of the plot can also be set using xlim() and ylim()
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*2 for x1 in x],
x, [x1*4 for x1 in x])
plt.grid(True)
plt.xlim(-1, 5)
plt.ylim(-1, 10)
plt.show()

Adding Labels
In [50]:
# Labels can be added to the axes of the plot
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*2 for x1 in x],
x, [x1*4 for x1 in x])
plt.grid(True)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.show()

Adding the Title

In [51]:
# The title defines the data plotted on the graph
x = range(5)
plt.plot(x, [x1 for x1 in x],
x, [x1*2 for x1 in x],
x, [x1*4 for x1 in x])
plt.grid(True)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title("Polynomial Graph") # Pass the title as a parameter to title()
plt.show()

Adding a Legend
In [52]:
# Legends explain the meaning of each line in the graph
x = np.arange(5)
plt.plot(x, x, label = 'linear')
plt.plot(x, x*x, label = 'square')
plt.plot(x, x*x*x, label = 'cube')
plt.grid(True)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title("Polynomial Graph")
plt.legend()
plt.show()

Adding a Markers
In [53]:
x = [1, 2, 3, 4, 5, 6]
y = [11, 22, 33, 44, 55, 66]
plt.plot(x, y, 'bo')
for i in range(len(x)):
x_cord = x[i]
y_cord = y[i]
plt.text(x_cord, y_cord, (x_cord, y_cord), fontsize = 10)
plt.show()

Saving Plots
In [54]:
# Plots can be saved using savefig()
x = np.arange(5)
plt.plot(x, x, label = 'linear')
plt.plot(x, x*x, label = 'square')
plt.plot(x, x*x*x, label = 'cube')
plt.grid(True)
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title("Polynomial Graph")
plt.legend()
plt.savefig('plot.png') # Saves an image names 'plot.png' in the current dire
ctory
plt.show()

Plot Types
Matplotlib provides many types of plot formats for visualising information
Scatter Plot
Histogram
Bar Graph
Pie Chart
Histogram
In [55]:
# Histograms display the distribution of a variable over a range of freque
ncies or values
y = np.random.randn(100, 100) # 100x100 array of a Gaussian distribution
plt.hist(y) # Function to plot the histogram takes the dataset as the para
meter
plt.show()

In [56]:
# Histogram groups values into non-overlapping categories called bins
# Default bin value of the histogram plot is 10
y = np.random.randn(1000)
plt.hist(y, 100)
plt.show()

Bar Chart
In [57]:
# Bar charts are used to visually compare two or more values using recta
ngular bars
# Default width of each bar is 0.8 units
# [1,2,3] Mid-point of the lower face of every bar
# [1,4,9] Heights of the successive bars in the plot
plt.bar([1,2,3], [1,4,9])
plt.show()

In [58]:
dictionary = {'A':25, 'B':70, 'C':55, 'D':90}
for i, key in enumerate(dictionary):
plt.bar(i, dictionary[key]) # Each key-value pair is plotted individually a
s dictionaries are not iterable
plt.show()

In [59]:
dictionary = {'A':25, 'B':70, 'C':55, 'D':90}
for i, key in enumerate(dictionary):
plt.bar(i, dictionary[key])
plt.xticks(np.arange(len(dictionary)), dictionary.keys()) # Adds the keys as lab
els on the x-axis
plt.show()
Pie Chart
In [60]:
plt.figure(figsize = (3,3)) # Size of the plot in inches
x = [40, 20, 5] # Proportions of the sectors
labels = ['Bikes', 'Cars', 'Buses']
plt.pie(x, labels = labels)
plt.show()

Scatter Plot
In [61]:
# Scatter plots display values for two sets of data, visualised as a collecti
on of points
# Two Gaussion distribution plotted
x = np.random.rand(1000)
y = np.random.rand(1000)
plt.scatter(x, y)
plt.show()

Styling
In [62]:
# Matplotlib allows to choose custom colours for plots
y = np.arange(1, 3)
plt.plot(y, 'y') # Specifying line colours
plt.plot(y+5, 'm')
plt.plot(y+10, 'c')
plt.show()

Color code:
b = Blue
c = Cyan
g = Green
k = Black
m = Magenta
r = Red
w = White
y = Yellow
In [63]:
# Matplotlib allows different line styles for plots
y = np.arange(1, 100)
plt.plot(y, '--', y*5, '-.', y*10, ':')
plt.show()
# - Solid line
# -- Dashed line
# -. Dash-Dot line
# : Dotted Line

In [64]:
linkcode
# Matplotlib provides customization options for markers
y = np.arange(1, 3, 0.2)
plt.plot(y, '*',
y+0.5, 'o',
y+1, 'D',
y+2, '^',
y+3, 's') # Specifying line styling
plt.show()

Streamlit
Install Streamlit
There are multiple ways to set up your development environment and install
Streamlit. Read below to understand these options. Developing locally with
Python installed on your own computer is the most common scenario.

Summary for experts

Set up your Python development environment.
Run:
pip install streamlit
Validate the installation by running our Hello app:
streamlit hello
Jump to our Basic concepts.
Installation steps for the rest of us
Option 1: I'm comfortable with the command line
Install Streamlit on your own machine using tools like venv and pip.

Option 2: I prefer a graphical interface

Install Streamlit using the Anaconda Distribution graphical user interface. This is
also the best approach if you're on Windows or don't have Python set up.

Option 3: I'd rather use a cloud-based environment

Use Streamlit Community Cloud with GitHub Codespaces so you don't have to go
through the trouble of installing Python and setting up an environment.

Option 4: I need something secure, controlled, and in the cloud

Use Streamlit in Snowflake to code your apps in the cloud, right alongside your
data with role-based access controls.

Install Streamlit using command line

This page will walk you through creating an environment with venv and installing
Streamlit with pip. These are our recommended tools, but if you are familiar with
others you can use your favorite ones too. At the end, you'll build a simple "Hello
world" app and run it. If you prefer to have a graphical interface to manage your
Python environments, check out how to Install Streamlit using Anaconda
Distribution.

Prerequisites
As with any programming tool, in order to install Streamlit you first need to make
sure your computer is properly set up. More specifically, you’ll need:

Python

We support version 3.8 to 3.12.

A Python environment manager (recommended)

Environment managers create virtual environments to isolate Python package

installations between projects.

We recommend using virtual environments because installing or upgrading a

Python package may cause unintentional effects on another package. For a
detailed introduction to Python environments, check out Python Virtual
Environments: A Primer.

For this guide, we'll be using venv, which comes with Python.

A Python package manager

Package managers handle installing each of your Python packages, including
Streamlit.

For this guide, we'll be using pip, which comes with Python.

Only on MacOS: Xcode command line tools

Download Xcode command line tools using these instructions in order to let the
package manager install some of Streamlit's dependencies.

A code editor

Our favorite editor is VS Code, which is also what we use in all our tutorials.

Create an environment using venv

Open a terminal and navigate to your project folder.

cd myproject
In your terminal, type:

python -m venv .venv

A folder named ".venv" will appear in your project. This directory is where your
virtual environment and its dependencies are installed.

Activate your environment

In your terminal, activate your environment with one of the following commands,
depending on your operating system.

# Windows command prompt

.venv\Scripts\activate.bat

# Windows PowerShell
.venv\Scripts\Activate.ps1

# macOS and Linux

source .venv/bin/activate
Once activated, you will see your environment name in parentheses before your
prompt. "(.venv)"

Install Streamlit in your environment

In the terminal with your environment activated, type:

pip install streamlit

Test that the installation worked by launching the Streamlit Hello example app:

streamlit hello
If this doesn't work, use the long-form command:

python -m streamlit hello

Streamlit's Hello app should appear in a new tab in your web browser!
Built with Streamlit 🎈
Fullscreen
open_in_new
Close your terminal when you are done.

Create a "Hello World" app and run it

Create a file named app.py in your project folder.
import streamlit as st

st.write("Hello world")
Any time you want to use your new environment, you first need to go to your
project folder (where the .venv directory lives) and run the command to activate
it:
# Windows command prompt
.venv\Scripts\activate.bat

# Windows PowerShell
.venv\Scripts\Activate.ps1

# macOS and Linux

source .venv/bin/activate
Once activated, you will see your environment's name in parentheses at the
beginning of your terminal prompt. "(.venv)"
Run your Streamlit app.

streamlit run app.py

If this doesn't work, use the long-form command:

python -m streamlit run app.py

To stop the Streamlit server, press Ctrl+C in the terminal.

When you're done using this environment, return to your normal shell by typing:

deactivate
Install Streamlit using Anaconda Distribution
This page walks you through installing Streamlit locally using Anaconda
Distribution. At the end, you'll build a simple "Hello world" app and run it. You can
read more about Getting started with Anaconda Distribution in Anaconda's docs.
If you prefer to manage your Python environments via command line, check out
how to Install Streamlit using command line.

Prerequisites
A code editor

Anaconda Distribution includes Python and basically everything you need to get
started. The only thing left for you to choose is a code editor.
Our favorite editor is VS Code, which is also what we use in all our tutorials.

Knowledge about environment managers

Environment managers create virtual environments to isolate Python package

installations between projects. For a detailed introduction to Python
environments, check out Python Virtual Environments: A Primer.

But don't worry! In this guide we'll teach you how to install and use an
environment manager (Anaconda).

Install Anaconda Distribution

Go to anaconda.com/download.

Install Anaconda Distribution for your OS.

Create an environment using Anaconda Navigator

Open Anaconda Navigator (the graphical interface included with Anaconda
Distribution).

You can decline signing in to Anaconda if prompted.

In the left menu, click "Environments".

Open your environments list in Anaconda Navigator
At the bottom of your environments list, click "Create".
Click "Create" to open the Create new environment dialog

Enter "streamlitenv" for the name of your environment.

Click "Create."

Finalize your new conda environment

Activate your environment
Click the green play icon (play_circle) next to your environment.

Click "Open Terminal."

Open a new terminal with your environment activated

A terminal will open with your environment activated. Your environment's name
will appear in parentheses at the beginning of your terminal's prompt to show
that it's activated.

Install Streamlit in your environment

In your terminal, type:

pip install streamlit

To validate your installation, enter:

streamlit hello
If this doesn't work, use the long-form command:

python -m streamlit hello

The Streamlit Hello example app will automatically open in your browser. If it
doesn't, open your browser and go to the localhost address indicated in your
terminal, typically http://localhost:8501. Play around with the app!

Close your terminal.

Create a Hello World app and run it

Open VS Code with a new project.

Create a Python file named app.py in your project folder.

Create a new file called app.py

Copy the following code into app.py and save it.

import streamlit as st

st.write("Hello World")
Click your Python interpreter in the lower-right corner, then choose your
streamlitenv environment from the drop-down.
Set your Python interpreter to your streamlitenv environment

Right-click app.py in your file navigation and click "Open in integrated terminal".
Open your terminal in your project folder

A terminal will open with your environment activated. Confirm this by looking for
"(streamlitenv)" at the beginning of your next prompt. If it is not there, manually
activate your environment with the command:

conda activate streamlitenv

In your terminal, type:

streamlit run app.py

If this doesn't work, use the long-form command:

python -m streamlit run app.py

Start your Streamlit app with streamlit run app.py
Your app will automatically open in your browser. If it doesn't for any reason,
open your browser and go to the localhost address indicated in your terminal,
typically http://localhost:8501.

Change st.write to st.title and save your file:

import streamlit as st

st.title("Hello World")
In your browser, click "Always rerun" to instantly rerun your app whenever you
save a change to your file.
Automatically rerun your app when your source file changes

Your app will update! Keep making changes and you will see your changes as soon
as you save your file.
Your app updates when you resave your source file

When you're done, you can stop your app with Ctrl+C in your terminal or just by
closing your terminal.

Pandas
No ratings yet
Pandas
163 pages
Print
No ratings yet
Print
296 pages
Pandas Class XII (2021-22)
No ratings yet
Pandas Class XII (2021-22)
246 pages
Python Unit - 6 Pandas
No ratings yet
Python Unit - 6 Pandas
106 pages
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
100% (1)
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
84 pages
Ch-2 Python Libraries For ML
No ratings yet
Ch-2 Python Libraries For ML
70 pages
PP&DS 3
No ratings yet
PP&DS 3
109 pages
Numpy Basics Introduction To
No ratings yet
Numpy Basics Introduction To
35 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
36 pages
DAY6 Pandas Seaborn
No ratings yet
DAY6 Pandas Seaborn
97 pages
Swarang Raut EDVA Experiment 1 Numpy Pandas
No ratings yet
Swarang Raut EDVA Experiment 1 Numpy Pandas
58 pages
M3-Introduction To Numpy and Pandas
No ratings yet
M3-Introduction To Numpy and Pandas
55 pages
Module Numpy
No ratings yet
Module Numpy
67 pages
Python Libraries
No ratings yet
Python Libraries
79 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
61 pages
NumPy - The Absolute Basics For Beginners - NumPy V2.4.dev0 Manual
No ratings yet
NumPy - The Absolute Basics For Beginners - NumPy V2.4.dev0 Manual
41 pages
NUMPY
No ratings yet
NUMPY
33 pages
Python Module 5
No ratings yet
Python Module 5
43 pages
4 Introduction To Python Part 3
No ratings yet
4 Introduction To Python Part 3
62 pages
Working With Pandas Notes
No ratings yet
Working With Pandas Notes
27 pages
Maintenance Manual For Brake of Geared Traction Machine: - 1-D55006-C Issued in March 2021
100% (1)
Maintenance Manual For Brake of Geared Traction Machine: - 1-D55006-C Issued in March 2021
68 pages
Unit 5
No ratings yet
Unit 5
40 pages
Unit 3
No ratings yet
Unit 3
42 pages
Fundamentals of Data Science Lab Manual
No ratings yet
Fundamentals of Data Science Lab Manual
34 pages
Numpy Basics
No ratings yet
Numpy Basics
66 pages
4 Introduction To Python Part 3
No ratings yet
4 Introduction To Python Part 3
48 pages
Numpy
No ratings yet
Numpy
44 pages
Week 4 - Introduction To Python #3
No ratings yet
Week 4 - Introduction To Python #3
47 pages
45B AIML Practical1.1
No ratings yet
45B AIML Practical1.1
57 pages
Data Visualization1
No ratings yet
Data Visualization1
52 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
42 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
28 pages
Section 7
No ratings yet
Section 7
33 pages
Num Py
No ratings yet
Num Py
31 pages
Basic Array Creation and Operations
No ratings yet
Basic Array Creation and Operations
27 pages
De Lab Manual New
No ratings yet
De Lab Manual New
24 pages
Lesson 2: Cultural and Sociopolitical Evolution
No ratings yet
Lesson 2: Cultural and Sociopolitical Evolution
7 pages
Fods Lab Manual
No ratings yet
Fods Lab Manual
26 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
19 pages
Num Py
No ratings yet
Num Py
21 pages
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
No ratings yet
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
25 pages
Lets Begin With Numpy
No ratings yet
Lets Begin With Numpy
16 pages
Num Py
No ratings yet
Num Py
18 pages
RAW Data
No ratings yet
RAW Data
22 pages
python-notes-BCC-302 (Unit - 05)
No ratings yet
python-notes-BCC-302 (Unit - 05)
25 pages
Introduction To Numpy Pandas and Matplotlib
No ratings yet
Introduction To Numpy Pandas and Matplotlib
2 pages
PipeFlow2Multi phaseFlowAssurance
100% (2)
PipeFlow2Multi phaseFlowAssurance
373 pages
Numpy Cheat Sheet
No ratings yet
Numpy Cheat Sheet
13 pages
05-Unit-V Python Lecture Notes
No ratings yet
05-Unit-V Python Lecture Notes
14 pages
Lecture 2 - NumPy I
No ratings yet
Lecture 2 - NumPy I
11 pages
Lecture 2 - NumPy I
No ratings yet
Lecture 2 - NumPy I
12 pages
NumPy Class 11th
No ratings yet
NumPy Class 11th
10 pages
Numpy & Pandas
No ratings yet
Numpy & Pandas
13 pages
DV Lab2 Updated
No ratings yet
DV Lab2 Updated
12 pages
New War Fronts Lie in Economic Zones Essay
No ratings yet
New War Fronts Lie in Economic Zones Essay
7 pages
Numpy Matplot
No ratings yet
Numpy Matplot
14 pages
Tutorial 2
No ratings yet
Tutorial 2
9 pages
L-1 (Introduction To Numpy & Panda) - Colab
No ratings yet
L-1 (Introduction To Numpy & Panda) - Colab
7 pages
SS 497 - Code of Practice For Design, Safe Use and Maintenance of Gantry Cranes
75% (4)
SS 497 - Code of Practice For Design, Safe Use and Maintenance of Gantry Cranes
20 pages
FDS Exp1,2
No ratings yet
FDS Exp1,2
4 pages
Political Economy 1st Edition Sarah. Comyn 2025 Scribd Download
No ratings yet
Political Economy 1st Edition Sarah. Comyn 2025 Scribd Download
77 pages
Module 4
No ratings yet
Module 4
4 pages
Lecture Notes On Cement
No ratings yet
Lecture Notes On Cement
58 pages
Lestronic II Battery Charger Owner Manual
No ratings yet
Lestronic II Battery Charger Owner Manual
4 pages
The Perfect Heist
No ratings yet
The Perfect Heist
107 pages
Untitled 8
No ratings yet
Untitled 8
2 pages
Latestlog
No ratings yet
Latestlog
34 pages
1 - Numpy
No ratings yet
1 - Numpy
1 page
F900got Connection 1 of 6
No ratings yet
F900got Connection 1 of 6
102 pages
CI Manual en
100% (1)
CI Manual en
63 pages
Labor Law Practice in ACI LTD
0% (1)
Labor Law Practice in ACI LTD
12 pages
2nd Law Labster - Simulation - Activity - On - Newton - S - Second - Law - of - Motion - Speed - and - Acceleration - Guillermo PDF
100% (1)
2nd Law Labster - Simulation - Activity - On - Newton - S - Second - Law - of - Motion - Speed - and - Acceleration - Guillermo PDF
3 pages
Esr-3807 23
No ratings yet
Esr-3807 23
28 pages
Creep Relaxation of A Gasket Material: Standard Test Methods For
No ratings yet
Creep Relaxation of A Gasket Material: Standard Test Methods For
6 pages
National Logistics Policy
No ratings yet
National Logistics Policy
4 pages
Vss 2600 Valetplus Web
No ratings yet
Vss 2600 Valetplus Web
2 pages
Class History Snhs 2020 2021
No ratings yet
Class History Snhs 2020 2021
3 pages
Quiz
No ratings yet
Quiz
3 pages
Entering A Trade at The Right Time
No ratings yet
Entering A Trade at The Right Time
5 pages
A Forensic Tale of Nepal
No ratings yet
A Forensic Tale of Nepal
3 pages
Movement Disorders
No ratings yet
Movement Disorders
4 pages
Lesson Plan Writing Character
No ratings yet
Lesson Plan Writing Character
3 pages
Controlling Blood Glucose 1
No ratings yet
Controlling Blood Glucose 1
2 pages
IPIndianJClinExpDermatol 9-3-142 146
No ratings yet
IPIndianJClinExpDermatol 9-3-142 146
5 pages
Whrb-Steam Blowing-New
No ratings yet
Whrb-Steam Blowing-New
3 pages
v10 New Product
No ratings yet
v10 New Product
6 pages
PETA LOKASI PENELITIAN PLG
No ratings yet
PETA LOKASI PENELITIAN PLG
1 page
Kindness Matters Font
No ratings yet
Kindness Matters Font
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials

Uploaded by

Data Analysis and Visualization Using Python Libraries and Streamlit - RTF Pre Read Materials

Uploaded by

Introduction to NumPy, Pandas and Matplotlib

NumPy is a package for scientific computing.

To accept the future behavior, pass 'sort=False'.

Matplotlib is a Python library that is specially designed for the development of

Limiting the Axes

Adding the Title

Summary for experts

Option 2: I prefer a graphical interface

Option 3: I'd rather use a cloud-based environment

Option 4: I need something secure, controlled, and in the cloud

Install Streamlit using command line

We support version 3.8 to 3.12.

A Python environment manager (recommended)

Environment managers create virtual environments to isolate Python package

We recommend using virtual environments because installing or upgrading a

A Python package manager

Only on MacOS: Xcode command line tools

Create an environment using venv

python -m venv .venv

Activate your environment

# Windows command prompt

# macOS and Linux

Install Streamlit in your environment

pip install streamlit

python -m streamlit hello

Create a "Hello World" app and run it

# macOS and Linux

streamlit run app.py

python -m streamlit run app.py

Knowledge about environment managers

Environment managers create virtual environments to isolate Python package

Install Anaconda Distribution

Install Anaconda Distribution for your OS.

Create an environment using Anaconda Navigator

You can decline signing in to Anaconda if prompted.

In the left menu, click "Environments".

Enter "streamlitenv" for the name of your environment.

Finalize your new conda environment

Click "Open Terminal."

Install Streamlit in your environment

pip install streamlit

python -m streamlit hello

Close your terminal.

Create a Hello World app and run it

Create a Python file named app.py in your project folder.

Copy the following code into app.py and save it.

conda activate streamlitenv

streamlit run app.py

python -m streamlit run app.py

Change st.write to st.title and save your file:

You might also like